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Preface 


The purpose of this book is to give an easily accessible introduction to the finite 
element method as a general method for the numerical solution of partial 
differential equations in mechanics and physics covering all the three main 
types of equations, namely elliptic, parabolic and hyperbolic equations. The 
main part of the text is concerned with linear problems, but a chapter 
indicating extensions to some nonlinear problems is also included. There is 
also a chapter 00 finite element methods for integral equations connected with 
elliptic problems. The book is based on material that 1 have used in 
undergraduate courses at Chalmers University of Technology, Goteborg. The 
first half of the book (Chapters | -7), which treat elliptic problems in a rather 
standard way, is a translation of a textbook in Swedish that appeared 198! 
[J1]. Two chapters on parabolic and hyperbolic problems present recent 
developments based on my work on discontinuous Galerkin and streamline 
diffusion type finite element methods using, in particular, finite elements for 
the time discretization as well. For first order hyperbolic problems these are 
the first finite element methods with satisfactory properties and thus show 
promise of extensive applications. For parabolic problems, time-discretization 
by the discontinuous Galerkin method gives new efficient methods and makes 
a precise error analysis with associated automatic time step control possible 
for the first time. 

‘The emphasis of the text is on mathematical and numerical aspects of the 
finite element method but many applications to important problems in 
mechanics and physics are also given. | have tried to keep the mathematics 
as simple as possible while still presenting significant results and maintaining 
a natural mathematical framework. Lately I have used the text of the book 
as part of the material in a series of undergraduate courses on partial 
differential equations leading up to graduate level treating in integrated form 
both mathematical questions on existence and regularity together with 
numerical methods. I have found this to be a fruitful approach where, on one 
hand the numerical methods can be given the necessary mathematical 
background, and on the other hand, the fascinating and important techniques 
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of solving differential equations numerically usmg computers can give crucial 
motivation for the theoretical mathematical studies. In fact. the numerical and 
mathematical aspects are intimately connected and the new possibilities of 
computer simulation makes proper understanding of the mathematical 
structure and propertics of the mathematical models very important also in 
applications. In the present book only a bare minimum of mathematical 
background is included and the reader is referred to the literature for a more 
complete account. 

The list of references is limited and many contributions important for the 
development of the various subjects have been omilted. I have given just a 
few references leading into the very rich literature on finite element methods. 

I want to thank Prof Raymond Chandler for revising the English. Tekn Lic 
Peter Hansbo for supplying most of the numerical results and Dr Kenneth 
Eriksson for reading parts of the material. Special thanks also to Mrs Yumi 
Karlsson who swiftly typed a first version of the text and with great patience 
helped me with seemingly endless alterations and corrections. 


Goteborg in July 1987 


Claes Johnson 


0. Introduction 


0.1 Background 


The mathematical models of science and engineering mainly take the torm 
of differential or integral equations. With the rapid development of high speed 
computers over the last decades the possibilities of efficiently utilizing these 
models have dramatically increased. Using computer-implemented math- 
ematical models, one can simulate and analyze complicated systems in 
engineering and science. This reduces the need for expensive and time- 
consuming experimental testing and makes it possible to compare many 
different alternatives for optimization, ete. In fact, with the new possibilitics 
an intense activity has started in Computer Aided Design, Engineering and 
Manufacturing (CAD, CAE and CAM) which is bringing revolutionary 
changes to engineering science and practice, and a new scientific field 
“scientific computing” is emerging as a complement to theoretical and 
experimental science. 

To use mathematical models on a computer one needs numerical methods 
Only in the very simplest cases is it possible to find exact analytical solutions 
of the equations in the model, and in general one has to rely on numerical 
techniques for finding approximate solutions. The finite element method 
(FEM) is a general technique for numerical solution of differential and integral 
equations in science and engineering. The method was introduced by 
engineers in the Jate 50’s and early 60's for the numerical solution of partial 
differential equations in structural engineering (clasticity equations, plate 
equations, etc). At this point the method was thought of as a generalization 
of earlier methuds in structural engineering for beams, frames and plates, 
where the structure was subdivided into small parts, so-called finite elements, 
with known simple behaviour. When the mathematical study of the finite 
element method started in the mid 60's it soon became clear that in fact the 
method is a genera] technique for numerical solution of partial differential 
equations with roots in the variational methods in mathematics introduced in 
the beginning of the century. During the 60's and 70’s the method was 
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developed, by engineers, mathematicians and numerical analysts, into a 
general method for numerical solution of partial differential equations and 
integral equations with applications in many areas of science and engineering. 
Today, finite element methods are used extensively (often integrated in CAD 
or CAE-systems) for problems in structural engineering, strength of matexials, 
fluid mechanics. nuclear engineering, electro-magnetism, wave-propagation, 
scattering, heat conduction. convection-diffusion processes, integrated cir- 
cuits, petroleum engineering, reaction-diffusion processes and many other 
areas. 


0.2 Difference methods — Finite element methods 


The basic idea in any numerical method for a differentia! equation is to 
discretize the given continuous problem with infinitely many degrees of 
freedom to obtain a discrete problem or system of equations with only finitely 
many unknowns that may be solved using a computer. The classical numerical 
method for partial differential equations is the difference method where the 
discrete problem is obtained by replacing derivatives with difference quotients 
involving the values of the unknown at certain (finitely many) points. 

The discretization process using a finite element method is different. In this 
case we start from a reformulation ot the given differential equation as an 
equivalent variational problem. In the casc of clliptic equations this variational 
problem in basic cases is a minimization problem of the form = 


(M) Find ueV such that F(u)<F(v) for all ve V. 


where V is a given set of admissible tunctions and F:V- R is a functional (je 
F(v)eR for all veV with R denoting the set of real numbers), The functions 
vin V often represent a continuously varying quantity such as a displacement 
in an elastic body, a temperature, etc, F(v) is the total energy associated with 
v and (M) corresponds to an equivalent characterization of the solution of the 
differential equation as the function in V that minimizes the total energy of 
the considered system. In general the dimension of V is infinite (ie the 
functions in V cannot be described by a finite number of parameters) and thus 
in general the problem (M) cannot be solved exactly. To obtain a problem 
that can be solved on a computer the idea in the finite clement method is to 
replace V by a set V}, consisting of simple functions only depending on finitely 
many parameters, This leads to a finitc-dimensional minimization problem 
of the form: 
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(Mn) Find une Vp such that (uy) F(y) for all ve Vi. 


This problem is equivalent toa (large) linear or nonlinear system of equations. 
The hope is now that the solution up of this problem is a sufficiently good 
approximation of the solution u of the original minimization problem (M), 
ic, the original partial differential equation. Usually onc chooses Vp to be a 
subset of V (in other words VycV, ie, if veV, then veV) and in this case 
(My) corresponds to the classical Ritz-Galerkin method that goes back lo the 
beginning of the ccntury. The special feature of a finite element method as 
a particular Ritz-Galerkin method is the fact that the functions in Vy are 
chosen to be piecewise polynomial. As will be seen below, onc may also start 
from more general variational formulations than the minimization problem 
(M); this corresponds eg to so-called Galerkin methods. 

To solve a given differential or integral equation approximately using the 
finite element method, one has to go through basically the following steps: 


(i) variational formulation of the given problem, 

(ii) discretization using FEM: construction of the finite dimensional space 
Vhs 

(iii) solution of the discrete problem, 

(iv) implementation of the method on a computer: programming. 


Often there are several different variational formulations that may be used 
depending eg on the choice of dependent variables. The choice of finite 
dimensional subspace Vp, ic, essentially the choice of the finite element 
discretization or the finite elements, is influenced by the variational formu- 
lation, accuracy requirements, regularity properties of the exact solution etc. 
To solve the discrete problem one needs optimization algorithms and/or 
methods for the numerical solution of large linear or nonlinear systems of 
equations. In this book we shall consider all the steps (i) -Gv) with (iv) kept 
at an introductory level. 

The advantage of finite element methods as compared with finite difference 
methods is that complicated geometry, general boundary conditions and 
variable or non-linear material properties can be handled relatively casily. In 
all these cases one meets unneccessary artificial complications with finite 
difference methodology. Further, the clear structure and versatility of the 
finite element method makes it possible to construct general purpose software 
for applications and there is also a large number of morc or less general finite 
element codes available. Also, the finite element method has a solid 
theoretical foundation which gives added reliability and in many cases makes 
it possible to mathematically analyze and estimate the error in the approxi- 
mate finite element solution. 


lL 


0.3 Scope of the book 


The purpose of this book is to give an introduction to the tinite element 
method as a genera] technique for the numerical solution of partial differential 
and integral equations in science and engineering. The focus is on mathemat- 
ical and numerical properties of the method, but we also consider many 
important applications to problems from various areas. An cffort has becn 
made to keep the mathematics simple while still presenting significant results 
and considering non-trivial problems of practical interest. 

We will consider the three main types of partical differential cquations, ic, 
elliptic, parabolic and hyperbolic equations. To connect these types of 
equations with problems in mechanics and physics, wo recall that elliptic 
equations model for example static problems in elasticity, parabolic equations 
mode! time-dependent diffusion dominated processes, and hyperbolic 
equations are used to describe convection or wave-propagation processes. We 
also give an introduction 1o finite element methods for boundary integral 
equations associated with elliptic boundary valuc problems in mechanics and 
physics. These methods are referred to as boundary element methods (BEM). 
We will mainly consider linear problems and only briefly comment on some 
non-linear ones. 

The material presented concerning elliptic problems is by naw standard, 
but for parabolic and hyperbolic problems we present recent developments 
that have not earlier appeared in text books. With these later contributions 
it is possible to give a unified treatment of the three main types of partial 
differential equations as well as boundary integral equations. In all cases we 
emphasize the basic role played by the stability properties of the finite element 
method and the relation to the corresponding properties of the partial 
differential or integral equation. 

The buok is an extended version of an carlier book in Swedish by the author 
that has been used for several ycars in undergraduate courses for engineering 
students at Chalmers University of Technology, Goteborg, Sweden, and also 
at other Scandinavian universities. 

The necessary prerequisites are relatively modcrate: Basic courscs in ad- 
vanced calculus and linear algebra and preferably some acquaintance with the 
most well-known Jincar partial differential equations in mechanics and 
physics. such as the Poisson equation, the heat equation and the wave 
equation. With some oversimplification we may say that the mathematical 
tools used in the book reduce to the following: Green’s formula, Cauchy's 
inequality and elementary calculus and linear algebra. 


The problem sections play an important role in the presentation and the 
reader is urged to spend time to solve the problems. 

As a general refcrence giving # more detailed presentation of the material 
in Chapters 1 to 5, we refer to [Ci] (see also [SF]}. For variational methods 
for partial differential equations in mechanics and physics. see eg [DL]. {ET] 
and [Ne]. 


1. Introduction to FEM for elliptic 
problems 


In this chapter we introduce FEM for some elliptic model problems and study 
the basic properties of the method. We first consider a simple one-dimensional 
problem and then some two-dimensional generalizations. 


1.1 Variational formulation of a one-dimensional 
model problem 
Let us consider the two-point boundary value problem 


—u’(x)=f(x) for 0<x<], 
(°) — wo)=u(2)=0, 


where v" =< and f is a given continuous function. By integrating the 
x 


equation —u”=f twice, it is easy to see that this problem has a unique solution 
u. We recall that the boundary value problem (D) can be viewed as modelling, 
in particular, the following situations in continuum mechanics: 


A An elastic bar 


Consider an elastic bar fixed at both ends subject to a tangential load of 
intensity f(x) (see Fig 1.1). Let a(x) and u(x) be the traction and tangential 
displacement at x, respectively, under the load f. Under the assumption of 
small displacements and a linearly clastic material, we have in the interval 
(0,1) 


o =Eu’ (Hooke’s law), 
-v =f (equilibrium equation), 
u(0)=u(1)=0 (boundary conditions), 
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where E is the modulus of elasticity. If we take here E=1 and eliminate o, 
we obtain (22). 


yy x £ 


u(x) 
Fig tf 


B An elastic cord 


Consider an clastic cord with tension 1, fixed at both ends and subject to 
transversal load of intensity f (see Fig 1.2). Assuming again small displace- 
ments, we have that the Wansversal displacement u satisfies (D) (cf Problem 
1.2). 


Fig 1.2 


C Heat conduction 


Let u be the temperature and q the heat flow in a heat conducting bar, subject 
to a distributed heat source of intensity f. Assuming the temperature to be 
zero at the end points, we have in the stationary case 


-—q =ku’ (Fourier’s law). 
qi =f (conservation of cnergy), 
u(0) =u(1)=0, 


where k is the heat conductivity, which again gives (D) if k=1. 


We shall now show that the solution u of the boundary value problem or 
differential equation (D) also is the solution of a minimization problem (M) 
and a variational problem (V). To formulate the problems (M) and (V) we 
introduce the notation 
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fowl v(x)w(x)dx, 
0 


for real-valued piecewise continuous bounded functions. We also introduce 
the linear space 
’ 


V={v: v is a continuous function on [0,1], v’ is piecewise 
continuous and bounded on [0,1], and v(0)=v(1)=0}, 


and the linear functional F: VR given by 
Li ye 
FO)=5 Ws) ¥), 


The problems (Mf) and (VY) are the following: 
(M) Find ueV such that F(u)=F(v) VWev, 
(VY) Find ueV such that (u’, v)=(f.v) WveV. 


Let us notice that in the context of the problems A and B above, the quantity 
F(v) represents the total potential energy associated with the displacement 


veV. The term : (v', v') represents the internal clastic cnergy and (f.v) the 


load potential. Thus, the minimization problem (M) corresponds to the 
fundamental Principle of minimum potential energy in mechanics. Further the 
variational problem (V) corresponds to the Principle of virtual work. 

Let us now first show that the solution u of (D) also is a solution of (V). 
To see this we multiply the equation —u’=f by an arbitrary functiun veV, 
a so-called test funtion v, and integrate over the interval (0, 1) which gives 

-(u", D=€, v). 


We now integrate the left-hand side by parts using the fact that v(0)=v(1)=0 
to get 
—(u", vy=—u'(Lv(1)+u'(U)v(0)+(u', v=’, v’), 


and we conclude that 
dd.) (w’, v=(f, v) Wey, 


which shows that u is a solution of (V). 

Next, we show that the problems (V) and (M) have the same solutions. 
Suppose then first that u is a solution to (V), let ve V and set w=v—u so that 
v=utw and weV. We have 


F()=F(utw)=5 (u' tw’, u'tw)-(f, utw) 


il 


NI 


(ww) -(f, ww’, w= (6, wed Ow, w=FCU), 


since by (1.1). (u', w')-(f, w)=0 and (w', w')20, which shows that u is a 
solution of (M4). On the other hand, if u is a solution of (M) then we have 
for any veV and real number ¢ 


F(u)<F(u+ev), 


since uteve¥. Thus, the differentiable function 
2 
a(e)=F(utev)=4 (u’, wytetu’, v+~ (v', v)=(F, u)elt. v9, 


has 4 minimum at e= and hence g’(0)=0. But 
BORO, v)-(E vy), 
and we sce that u is a solution of (V). 
Let us also show that a solution to (V) is uniquely determined, Suppose then 
that u; and uz are solutions of (V). ie, u1, uzéV and 
(uj, v)=(f, ¥) We, 
(uy, v')=(f,v) Wev. 


Subtracting these cquations and choosing v=uj—uze V, we get 
1 
SJ (uj-u3)? dx=0, 
0 


which shows that 
uj(x)-—u3(QQ)=(uy—ua)'(J=0 —-WxeE[0, 1]. 
It follows that (u;—uz2)(x) is constant on [0,1] which together with the 
boundary condition uj(0)=u2(0)=0 gives uj(x)=uUo(x), Wxe[O.1], and the 
uniqueness follows. 
To sum up, we have shown that if u is the solution to (D), then u is the 


solution to the equivalent problems (M) and (V) which we write symbolically 
as 


(PI) > (Ye). 


Let us finally also indicate how to sce that if u is the solution of (V) then u 
also satisties (D). Thus, we assume that ue¥ satisfics 


1 1 
Ju'v'dx—Jfvdx=0 vvev. 
0 Q 


If we now assume in addition that u” exists and is continuous, then we can 
integrale the first term by parts to get, using the fact that v(0)=v(1)=0, 
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1 
—J(u"+f)vdx=0 VWev. 
0 


But with the assumption that (u"+f) is continuous this relation can only hold 
if (cf Problem 1.1) 


{u"+ Hx) =0 0<x<l, 


and it follows that u is the solution of (D). 

Thus we have seen that if u is the solution of (V) and in addition satisfies 
a regularity assumpton (u” is continuous), then u is the solution of (D). It is 
now possible to show that if u is the solution of (V), then u in fact satisfies 
the desired regularity assumption and thus we have (¥)> (2) which shows 
that the three problems (42), (V) and (M) are equivalent (cf Section 1.5 
below). 


Problems 
11 Show that if w is continuous on [U, 1] and 


1 
Jwvdx=0 Wev, 
0 


then w(x)=0 for xe[0, 1]. 


1.2 Show that under suitable assumptions the problem B above can be 
given the formulation (1.1). 


1.2 FEM for the model problem with piecewise 
linear functions 


We shall now construct a finite-dimensional subspace V, of the space V 
defined above consisting of piecewise linear functions. ‘Yo this end let 
O=xo<x1...<<Xm<xm41=1, be a partition of the interval (0,1) into subinter- 
vals 1j=(xj-1, ¥}) of length hy=xj—x)-y, j=l... .. M41 and set h=max hj. 
The quantity h is then a measure of how fine the partition is. We now let V) 
be the set of functions v such that v is linear on each subinterval ]j, v is 
continuous on [0,1] and v(0)=v(1)=0 (ef Fig 1.3). 


Fig 1.3 Example of a function ve Vi, 


We observe that Vac V. As parameters to describe a function ve Vp we may 
choose the values nj=v(xj) at the node points xj, j=0,.  ., M41. Let us 
introduce the basis functions pjeVn, j=1.. ..,M, defined by 


1 if i=) 


jC) = (9 if iF], i, jal... M, 


ie, gj isthe continuous piecewise linear function that takes the value 1 at node 
point xj and the value 0 at other node points (see Fig 1.4). 
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Fig 1.4 The basis function g; 


A function veV), then has the representation 
M 
v(x}= Zninitx), xe[0,1], 
is 


where nj=v(x;), ie, each veV, can be written in a unique way as a linear 
combination of the basis functions gj. In particular it follows that Vp is a linear 
space of dimension M with basis tpi} iM. 

The finite element method for the boundary value problem (D) can now 
be formulated as follows: 


(My) Find uye Vy such that F(up)<F(v) VveVp. 


In the same way as above for the problems (MM) and (V), we see that (M;) 
is equivalent to the finite-dimensional variational problem (V,): Find u,€ Vn 
such that 


(1.2) Cun’, ¥')=(E, ¥) WevVp. 


Thus the finite element method for (D) can be formulated as (Vq) or 
equivalently (My). The problem (Vy) is usually referred to as Galerkin’s 
method and (My) as Ritz’ method. We observe that if t,é Vp satisfies (1.2), 
then in particular 


(1.3) Cun’, @=(f @) j=1,  ..M, 


and if these equations hold, then by taking linear combinations, we see that 
up satisfies (1.2). Since 


M 
un 0)= 3 Eipi(x), S=un(%), 
i= 
we can write (1.3) 
M : : 
(1.4) 28. H)=6 OD j=1,..-,M. 
which is a linear system of equations with M equations in M unknowns 
E;,..., Sm. In matrix form the linear system (1.4) can be wrillen as 
(1.5) AE=b, 
where A=(a,,) is the MXM matrix with elements a,=(g;', g,'), and where 
§=(),.. .. &m) and b=(b), . . .. by) with b=(f, m) are M-vectors: 
an. aim 6 by 
A=| 2] .g=[ - | b= 
am) . . aqM, Eu bm 


The matrix A is called the siiffness matrix and b the load vector, with 
terminolugy from early applications of FEM in structural mechanics. 

The elements a=(¢i', @;') in the stiffness matrix A can easily be computed: 
We first observe that (q;’, @y')=0 if |i-j|>1 since in this case for all xe[0,1] 
either p(x) or @pj(x) is equal to zero, Thus, the matrix A is tri-diagonal, ie, 
only the elements in the main diagonal and the two adjoining diagonals may 
be different from zero. We have for j=1,. .,M 


20 


<P daw Pan ds 
(Hj) = z dxt oe ene 
x1} aj 5 


and for j=2,. ..M, 


; , ay Al 1 
(9; B-0= (nis ga-f sa dx=- 
j 
Note also that the matrix A is symmetric and positive definite since 


M 
(gi, G7) =(0)', Gi’) and with vix)= = nypi(x), we have 
- 


M M we ' row 
z Mer, = CE ni’. EHP y=, v20, 
je is = 


i 


with equality only if v’=0, that is sinee v(U)=0 only if v=0, or nj=0 for j=l. 
...,M. We recall that a symmetric MXM matrix $=(5jj} is said to be positive 
definite if 


M M 
yn Sn=_ = nisiiny>O WneR™, n #0, 
ij=l 


where the dot denotes the scalar product in RM. We also recall that a 
symmetric matrix S is positive definite if and only if the eigenvalues of S are 
strictly positive. 

Since a positive definite matrix is non-singular it follows that the linear 
system (1.5) has a unique solution, We also note that A is sparse, ic, only 
a few elements of A are different from zero (A is tridigagonal). This very 
important property depends, as we have seen, on the fact that a basis function 
pj of Vj, is different from zero only on a few intervals and thus will interfere 
only with a few other basis functions. The fact that the basis functions may 
be chosen in this way is an important distinctive feature of the finite element 
method. F 

In the special case of a uniform partition with hy =h ves the system 
(1.5) takes the form 


1 2 -1 0 2 3 1) = |r 
hyi 2 -1, 0 : 
(1.6) OS re he 
ate eet 
: 2 i ge ae 
0. ---+-0 -1 2) [Em buy 
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After division by h this may be interpreted as a variant of a standard difference 
method for (D) where the elements of the right hand side bj/h are mean values 
of f over the intervals (x\-1, Xj+1) (cf Problem 1.4). 

To sum up, we have seen that the finite clement method (Vp) for (2) leads 
toa linear system of equations with a sparse, symmetric and positive definite 
stiffness matrix. 


Problems 

1.3 Construct a finite-dimensional subspace V) of V consisting of functions 
which are quadratic on each subinterval 1j of a partition of 1=(0, 1). 
How can one choose the parameters to describe such functions? Find 
the corresponding basis functions. Then formulate a finite element 
method for (PD) using the space Vp and write down the corresponding 
linear system of equations in case of a uniform partition. 


1.4 Formulate a difference method for (D) and compare with (1.6). 


1.5 Consider the boundary value problem 


(1.7) ae 0<x<1, 


u(0)=u'(0)=u(1)=u'(1)=0. 


Here u represents cg the deflection of a clamped beam subject to a 
transversal force with intensity f (see Fig 1.5) 


Fig 1.5 


{a) In mechanics this beam problem would naturally be formulated as 
follows: 


(1.8a) M=u’, 0<x<l, 
(8b) M"=f,  0<x<l, 
(1.8c) — u(0)=u'(0)=u(1)=u'(1)=0, 


What does here the quanity M represent and what is the mechanical 
interpretation of (1.8a-¢)? 
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(b) Show that the problem (1.7) can be given the following variational 
formulation: Find ue W such that 
(u’, v’)=(f.v) WveW, 
where W={v: v and v’ are continuous on [0,1], v" is piecewise 
continuous and v(0)=v‘(0)=v(1)=v'(1)=0}. 
(c) For I=[a, b] an interval, define 
P3(1)={v: v is a polynomial of degree<3 on I, ie. v has the form 
v(x)=ayx?-+agx?-+ ayx tag, x€l where ajeR}. 


Show that veP3(1) is uniquely determined by the values v(a), v'fa). 
v(b), v’‘(b). Find the corresponding basis functions (the basis function 
corresponding to the value v(a) is the cubic polynomial v such that 
v(a)=1, v’(a)=0, v(b)=v'(b)=0, etc). 

(d) Starting from (c) construct a finite-dimensional subspace W,, of W 
consisting of piecewise cubic functions. Specify suitable parameters to 
describe the tunctions in Wy and determine the corresponding basis 
functions. 

(c) Formulate a finite clement method for (1.7) based on the space Wh. 
Find the corresponding linear system of equations in the case of a 
uniform partition. Determine the solution in eg the case of two 
intervals and f constant. Compare with the exact solution. 


1.3 An error estimate for FEM for the model 
problem 


We shall now study the error u—u, where u is the solution of (D) and Un is 
the solution of the finite element problem (Vp), i€, un€ Vp and up satisfics 
(1.2). The proof is based on the following equation for the error: 


(1.9) ((u—up)’, v')=0 VveVh. 


This follows by recalling that (u’, v')=(f,v), VWveV, so that in particular since 
Vac V¥ 


(1.10) (u’, v’)=(f, v) Vvevy. 
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Subtracting (1.2) from (1.10) we obtain (1.9). 
We shall use the notation 


1 
Neell=(w, w) 2 (f w2dxyt2, 
0 
Il [| is the norm associated wi 
Cauchy’s inequality: 


(4.1) Iv, w)lstlvi] |Jwl]. 


We shall prove the followin, 
Sense Up is the best possible 


th the scalar product (.,.). We also recall 


g estimate for u- Up which shows that in acertain 
approximation to the exact solution u. 


Theorem 1.1. Foy any ve Vi we have 


(Nu uy)'(f<{(u-v) "I. 


Proof Let veVy be arbitrary and set w= 


’ Un~v. Th i 
with v replaced by w, we get, using Cauc : ela 


hy’s inequality also, 


Iu un)? ((u-uyy’, (uuy)') +((u—uy)', w! 
= Y. < un)’, w’) 
=((u-u,)’, (a—u $6) = (Caw, 7 
<|)(u-u,)’j) leew. ((u-uny’, (a—v)’) 


Dividing by HCu-up) ‘I we j 
obtain the statement i 
[l(a-un)'i[=0, then the theorem clearly holds). Q beeen cL 


u at the nodes Xj, ie, 


Gh(x=u(xj) =0,..., Mad, 


Fig 1.6 The interpolan tin, 
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{is cusy 10 sce (cf any basic course in numerical analysis or Problem 4,1 below) 
thal if Gneé Vy is chosen in this way, then for Osx-81, 


(1.12) u'(xJ-G,O)i< bh max fu'(y)}, 
Osysl 
(1.13) u(x)—an(a)/ SE max ju") 
8 osyes 


Using (1.12) and Theorem |.1 we now obtain the following estimate for the 
derivative of the error u—up: 


(14) l(u-up)y'IlS br amar fa"): 

Sy & 
Since (u-up)(U)=0 we obtain from (1.14) by integration the following 
estimate for the error itself (cf Problem 1.6): 


(1.15) u(x) —un(x)/ S$ h max ju’(y)| for O<x<1. 
UstyS1 


We observe that this latter estimate is less sharp than the estimate (1.13) for 
the interpolation error where we have a factor h?. With a more precise analysis 
it is possible to show that in fact also the finite element method gives a factor 
h? for the error u—uy (cf also Problem 1.19 below). 

Let us note that the quantity u’, representing a deformation or a torce in 
Examples A and B above, is usually of more (or at least no less) practical 
interest than the quantity u itsclf, representing in these cases a displacement. 
Thus the estimate (1.14) is of independent interest and not just a step on the 
way to an estimate of u—uUy. 

Let us also notice that to prove (1.14) we do not need to concretely construct 
Uy (which would require knowledge of the exact solution u); we only have to 
be able to give an estimate of the interpolation error, for instance of the form 
(1.12), (1.13), 

To sum up, by Theorem 1.1 we have the qualitative information that 
\|(u—up)'|{ is ‘tas small as possible’’ and by using also the interpolation estimate 
(1.12) we obtain the quantitative error estimate (1.14), which in particular 
shows that the error tends to zero as the maximum length of the subintervals 
I, tends to zero if u” is bounded on [0,1]. 


Problem 
1.6 Prove (1.15) using (1.14) and the boundary conditions 
u(0)=uy(0)=0. Hint: Use the relation 


(u~un)6n)= few) Gy 


together with Cauchy’s inequality. 
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1.4 FEM for the Poisson equation 


We will now consider the following boundary value problem for the Poisson 
equation: 


(1.16a)  — Aust in Q, 
(1.16b) u=0 on, 


where 2 is a bounded open domain in the plane R2=({x=(xj, x2}: xje RY with 
boundary [, f is a given function and as usual, 
2, 2 
Au= oa eu. 
8x; Oxy 
A number of problems in physics and mechanics are modelled by (1.16); u 
may represent for instance a temperature, an electro-magnetic potential or 
the displacement of an elastic membrane fixcd at the boundary under a 


transversal load of intensity f (see Fig 1.7 and compare also with problem B 
of Section 1.1). 


Fig 1.7 


Let us now before continuing recall a certain ¢sreen’s formula which will 


be of fundamental importance in what follows. Let us start from the divergence 
theorem (in two dimensions): 


J div A dx=J A-n ds, 
Q r 


where A=(Aj, Az) is a vector-valued function defined on Q, 
OAr , Az 
Ox, Oxz 


div A= 


and n=(n), nz) is the outward unit normal to . Here dx denotes the element 
of area in Ry and ds the clement of arc length along I. If we apply the 
divergence theorem to A=(vw, 0) and A=(0, vw), we find that 
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» OY dye fuwn, ds. i= 12. 
(1.17) fs wixs [v3 dx Ba s 


$2 O% Q: OM 
- 


i onwie, Wva| oe oy 
Denoting by Vv the gradient of v, Le, \ 


5 a , we get from 
8x," 9X2 
(1.17) the following Green’s formula: 


=f wy eae 
hee oF 3x; 8x2 OX2 
ew . ew 
=f OP ai dey ON vs} as-Is| <3 +a dx 
cul 9x Ox2 2 Ldxy 9x3 
ow 
={y  ds—J vAw dx, 
fy on i 
ie, r 
Ww 
. = fy o™ ds— Jv Aw dx, 
(1.18) ive Vw dx fy an J 


where 


Bn Om 8x2 


is the normal derivative, ie, the derivative in the outward normal direction 


to the boundary f. ; 
We shall now give a variational formulation of problem (1.16). We shall 


first show that if u satisfies (1.16), then u is the solution of the foliowing 
variational problem: Find ueV such that 

(1.19) a(u, =f, ¥) We, 

where Be de ou m 


a(u, ye Lye Mii 3x1 ax au 86 


(f. v= JSfvdx, 
Q 
A ov Ov ‘ . 
V={v: v is continuous on Q, ah and Ox are piecewise 
continuous on Q and v=0 on T}. 


In exactly the same way as in Section 1.1. we see that ue ¥ salisfies een 
and only if wis the solution of the following minimization problem: Find ue 
such that F(u)<F(v), YveV, where F(v) is the total potential energy 


Fivy=e atv, vy) 9). 
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To see that (1.19) follows from (1.16) we multiply (1.16a) with an arbitrary 
test function ve V and integrate over Q. According to Green’s formula (1.18) 
we then have 


(f, vyy=-fAu vdx=—f SU vdxtfVu- Vv dx=a(u, v), 
Q Qn rs) 


where the boundary imegral vanishes since v=0 on [. On the other hand, if 
ueV satisfies (1.19) and u is sufficiently regular, then we see as in Section 1.1 
that u also satisfies (1.16) (cf Problem 1.10). 

Let us now construct a finite-dimensional subspace Vp of V. For simplicity 
we shall assume that [is a polygonal curve, in which casc we say that £2 is 
a polygonal domain (if F is curved we may first approximate T with apolygonal 
curve, see Chapter 12). Let us now make a triangulation of Q, by subdividing 
Q into a set Th=K),. . ., Km of non-overlapping triangles Kj, 

Q= UK=KiUK2. UKm, 
KeT, 


such that no vertex of one triangle lies on the edge of another triangle (see 
Fig 1.8) 


We introduce the mesh parameter 
h=max diam (K), diam (K)=diameter of K=longest side of K. 
Ket, 
We now define Vj as follows: 


Va= iv: v is continuous on Q, vi is linear for KeTy, v=0 on T}. 
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ieti { ie. the tunction defined on K 
a denotes the restriction of v 10 K, ie, " ‘ 
het von K. The space Vp consists of all continuous a een 
ct ‘ i We tice thal Vac. As 
i d vanish on I. We no 
are linear on cach triangle K an aa ae 
i i Vn we choose the values i 
eters to describe a function VEVh 
ewes N;, i=l,....M, of Tr (ee Fig 1.8) but exclude the nodes on a 
ree since v=0 on T The corresponding basis functions @j€ Vh. 
° 1 
j=i,..., M. are then defined by (see Fig 1.9) 


_ ft ifisj 
@(N)=di= ie if ij i j=l... M 


Fig 1.9 The basis function ©; 


We see that the support of qj (the set of points x for which ee) an 
of the triangles with the common node Ny (the shaded area in Fig 1.9). 
{unetion veV_ now has the representation 


M 
v(x)= E ny), ny=v(Nj), for xeQUP 
\= 


We can now formulate the following finite element method for (1.16) 


: Fi such that 
starting from the variational formulation (1.19): Find une Vp such 


(1.20) a(un. Y=, vy) WreVh. 
Exactly as in Section 1.2 we see that (1.20) is equivalent to the linear system 
of equations 
(1.21) Ag=b, 
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where A=(ajj), the stiffness matrix, is an MXM matrix with elements 
ay=aCgi, i) and E=(5)). b=(bi) are M-vectors with clements &=un(N)), 
bi=(f, i). 

Clearly A is symmetric and as in Section 1.2 we see that A is positive definite 
and thus in particular non-singular so that (1.21) admits a unique solution E. 
Moreover, A is again sparse; we have thal a3j=0 unless Nj and Nj are nodes 
of the same triangle. 

In the same way as in Section 1.2 we realize that upeVp is the best 
approximation of the exact solution u in the sense that 


(1.22) || Vu- Vupl}s|[Vu- Vvil WevVn. 
where 


|Vvi[=alv.v)!? = (f| Vv/2dx) 
Q 


In particular we have 

(1.23) |[Vu-Vualls|| Vu- Vital), 

where ty is the interpolant of u, ie, dye Vp and 
Gy(Ni)=u(N,) i=1,...,M. 


In Chapter 4 we prove that if the triangles Ke'l), are not allowed to become 
tuo thin, then 


(1.24) \|Yu-Vaali<Ch. 


Here and below we denote by C a pusitive constant, possibly different at 
different occurences, that does not depend on the mesh parameter h. In the 
case (1.24) the constant C depends on the size of the second partial derivatives 
of u and the smallest angle of the triangles Ke Ty. Onc can also prove (see 
Section 4.7) that 


Hu-upll=(f(u-up)edx)!?s Ch? 
Q 
with a similar dependence of C. In particular these estimates show that if the 


exact solution u is sufficiently regular, then the error and the gradient of the 
error UU, tend to zero in the norm |{-|] as h tends to zero. 


Example 1.1, Let Q be a square with side length | and let Ty be the uniform 


triangulation of @ according to Fig 1.10 with the indicated enumeration of 
the nodes of Th. 
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Fig L.Jl 


In this case the linear system (1.21) reads as follows: 


rowN+l1 [4-1 0 -1 GO + + Of [Sif = [br]. 
-1 4 -1 0-1 0 . 


0-1 4-1 U-l + + , . 
-1 0-1 4-1 0-1 0 i 
(1.25) 0 Bh » 0-1 . 


Note that here the left-hand side of equation i is a linear combination of the 
values of up at the § nudes indicated in Fig 1.10 with coefficients given in Fig 
1.11. Dividing by h? we recognize this as the linear system obtained by 
applying the so-called 5-point difference method for (1.16) with the com- 
ponents ot the right-hand side being weighted averages of f around the nodes 
N, (cf Problem 1.7 below). O 

The elements aj=a(@;, @j) in the stiffness matrix A are usually in practice 
computed by summing the contributions from the different triangles: 


(1.26) agi, P= % an(T, i). 
KeT,, 
where 
aK(g,. m=[Vor Vgjdx. 
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We notice that ax(@j, @j)=0 unless both nodes Nj and Nj are vertices of K. 
Let N,, N, and Nx be the vertices of the triangle K. We call the 3x 3-matrix 


(1.27) ax(Ps, i) ax(G,. ©) aK( Pi, Px) 
ax( gj. 4) aK(Gj- Px) 
sym AK(Pks Px) 


the element stiffness matrix for K. The global stiffness matrix A may thus be 
computed by first computing the element stiffness matrices for each K eT), and 
then summing the contributions from each triangle according to (1.26). Ina 
corresponding way we compute the right-hand side b. This process of 
computing A and b by summation is called the assembly of A and b. 

To compute the elements in the stiffness matrix (1.27) we clearly work with 
the restrictions of the basis functions gj, gj and @ to the triangle K, Denoting 
these restrictions by yj. yj and py, we have that each y is a linear function 
on K that takes the value one at one vertex and vanishes at the other two 
vertices of K, We call 1p,, 1p) and yy, the basis functions on the triangle K, cf 
Fig 1.12. If wis a linear function on K, then w has the representation 


w(x)=w(Ni Pi) +w(Ni)Yi(x) +w(N, A(X), xEK. 


4 


x4 


Fig 1.12 The basis function wy, associated with K. i 


Problems 
1.7 Formulate a difference method for (1.16) in the case when Q is a square 
using the difference approximation 


U u(x, +h, x2)—2u(x,, x2)+u(x;—h, x2 
Y (x1, 12)~ (1 2) -2u( ls ! ) 
Ox4 h 
: ear Wu 5 
and a corresponding approximation for ae Compare with Example 
ine 2 
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1.8 Find the linear basis functions for the triangle K with vertices al 
(0, 0), (h, 0) and (0, h). Show that the corresponding element stiffness 
matrix (1.27) is given by 


1 

+ -= 

2 

al l 

2 2 
1 

3 0 


Using this result show that the linear system (1.25) of Example [.1 has 
the stated form. 


1.9 Find the element stiffness matrix (1.27) for a general triangle K in 
yerms of the coordinates ai=(aj, a3), i=1, 2, 3, of the vertices of K. 


1.10 Show that if ue¥V satisfies (1.19) and u is twice continuously differ- 
entiable, then u satisfies (1.16). 


1.11 Find the element stiffness matrix for the problem 
-uw’=f for 0<x<!, u(0)=u(l)=0, 


if we use piecewise quadratic functions according to Problem 1.3. Then 
determine the corresponding global stiffness matrix in the case of a 
uniform subdivision. Can you interpret the resulting equations as 
difference approximations of the equation —u"={? 


1.5 The Hilbert spaces L2(Q), H'(Q) and HQ) 


When giving variational formulations of boundary value problems for partial 
differential equations, it is from the mathematical point of view natural and 
very useful to work with function spaces V that are slightly larger (i¢ contain 
somewhat more functions) than the spaces of continuous functions with 
piecewise continuous derivatives used in the preceeding sections. It is also 
useful to endow the spaces V with various scalar products with the scalar 
product related to the boundary value problem. More precisely, V will be a 
Hilbert space, (see below). 
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Before introducing these Hilbert spaces let us recall a few simple concepts 
from linear algebra: If V is a linear space, then we say thal L is a dinear form 
on V if L: VOR, ie, L(v)eR for veV, and L is linear, ie. for all y, weV 
and B, @eR 


L(Bv + Ow)=BL(v) + 8L(w). 


Furthermore, we say that a(. ,.) is a bilinear form on VXV if a: VXV—>R, 
ie, a(v, w)eR for v, weV, and a is linear in each argument, ie, for all u, v, 
weV and B, 0ER we have 


a(u, By+Ow)=Ba(u, v)+@a(u, w), 
a(But Ov, w)=Ba{u, w)+éa(v, w). 
The bilinear form a(.,.) on VXV is said to be symmezsric if 
a(v,w)=a(w,v) Vv, wevV. 
A symmetric bilinear form a(...) on VxV is said to be a scalar product on 
Vif 
a(v, v)>0 Yvev, v#0. 
The norm ||- ||, associated with a scalar product a(.,.) is defined by 
iIvla=(av, v))'?, Wwe. 
Further, if <.,.> is a scalar product with corresponding norm |] - |}, then we 
have Cauchy’s inequality 
(1.28) |<v, w>}s|[v1) |hvll. 


We further recall that if V is a linear spacc with a scalar product with 
corresponding norm || - ||. then V is said to be a Hilbert space if V is complete, 
ie, ifevery Cauchy sequence with respect to |] : |] is convergent. We recall that 
a sequence vq, v2, v3,.. ., of elements v, in the space V with norm ||°|| is said 
to be a Cauchy sequence if for all e>0 there is a natural number N such that 
I|vi-vjll<e if i, j>N. Further, v, converges to v if ||v—vi|| > Oasis *. The 
reader unfamiliar with the concept of completeness may bypass this remark 
and think of a Hilbert space simply as a lincar space with a scalar product. 

We now introduce some Hilbert spaces that are natural to use for variational 
formulations of the boundary value problems we will consider. Let us start 
with the one-dimensional case. If I=(a, b) is an interval, we define the space 
of “square integrable functions” on I: 


L,(D={v: v is defined on and fv7dx<e}. 
{ 
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The space La(I) is a Hilbert space with the scalar product 


(v, w)=/vw dx, 
1 


and the corresponding norm (the La-norm): 


Hhvlinaay=(frrdey'= (vs yy, 


By Cauchy's inequality. 
Iv, MISIIMiLalivtiia, 


we see that (v. w) is well-defined, ic, the integral (v, w) exists, if v and 
weLa(I). 


Remark. To really appreciate the definition of L2(1) and realize that this 
space is complete requires some familiarity with the Lebesgue integral, In this 
book, however, it is sufficient to get an idea of L2(J1) by using the usual Rie- 
mann integral; from this point of view we may think of a “typical” function 
veL,(I) as a piecewise continuous function, possibly unbounded, such 
that fv7dx<e. O 

i 


Example 1.2 We have (bat the function v(x)=x78, xel=(0, 1) belongs to L2(T) 
: I 

FR<=. 0 

if 8 3 

We also introduce the space H!(1)=(v: v and v‘ belong to LA(1)}, and we equip 
this space with the scalar product 


(vawpntay= {ow ty wax. 


and the corresponding norm 

(ivlay= Se Pda)? 
The space H'{1) thus consists of the functions v defined on I which together 
with their first derivatives are square-integrable, ie, belong to L,(1). 


In the case of boundary value problems of the form —u"=f on [=(a, b) with 
boundary conditions u(a)=u(b)=O0, we shall use the space 


Hi()={veH!(): v(@)=v(b)=9} 
with the same scalar product and norm as for H}{1). 
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Our introductory boundary value problem 


-u"=f on I=(0, 1), 
u(0)=u(1)=0, 


can now be given the following variational formulation: 


(1.29) 


(30) Find ueH4(1) such that (i. v)=(f, vy) We HAD. 


with (.,.) as in Section 1.1. If we compare (1.30) with the formulation (V) 
in Section 1.1, we note that the space Hi) is larger than the space V used 
in the formulation (V). The space H§(1) is specially tailured for a variational 
formulation of (1.29) and is in fact the largest space for which a variational 
formulation of the form (1.30) is meaningful. From a mathematical point of 
view the “right” choice of function space is essential since this may make it 
easier to prove the existence of a solution to the continuous problem. From 
the finite element point of view the formulation (1.30) as opposed to (V) is 
of interest mainly because the basic error estimate for the finite element 
method is an estimate in the norm indicated by (1.30) (the H'(1)-norm). 
Further, using the standard notation La(1), H'(B, HAd) ete, we may give our 
boundary value problems variational formulations in a concise way, as will 
be seen below. 
Now let Q be a bounded domain R“, d=2 or 3, and define 


L2(2)={v: v is defined on Q and Jv’dx<e}, 
Q 


H'(Q)= (veL2(2): a €L{Q), i=l... d), 


and introduce the corresponding scalar products and norms 


(v, w= fyw dx, [Iie c= Cfv2dx) 2, 
Q 2 


(¥, WHI(Q)=J[vw+ Vv- Vwdx, 
Q 


Ilvlleecay= S144 1 vP]dx)2. 
We also define 
Hi(Q)={veH!(Q): v=0 on T}, 


where [is the boundary of Q and we equip HQ) with the same scalar product 
and norm as H!(a2). 
The boundary value problem 
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a on 

can now be given the following variational formulation: 

Y Find ucHg(Q) such that a(u,v)=(f.v) Wwe HY), 
or equivalently 

(M) Find ueH{(Q) such that F(u)<F(v) Wve H}(Q), 
where 


Fl)=atvov)—(6¥), 


a{u, ala: Vvdx, (oy)= [fv dx. 


Remark The formulation (V) is said to be a weak formulation of (D) and the 
solution of (V) is said to be a weak solution of (PD). If u is a weak solution 
of (D) then it is not immediately clear that u is also a classical solution of (D), 
since this requires u to be sufficiently regular so that Au is defined in aclassical 
sense. The advantage mathematically of the weak formulation (V) is that it 
is casy to prove the existence of a solution to (V), whereas it is relatively 
difficult to prove the existence of a classical salution to (1). ‘lo prove the 
existence of a classical solution of (D) one usually starts with the weak solution 
of (D) and shows, often with considerable effort, that in fact this solution is 
sufficiently regular to be also a classical solution. For more complicated, eg 
non-linear problems, it may be extremely difficult or practically impossible 
to prove the existence of classical solutions whereas existence of weak 
solutions may still be within reach. O 


Problems 
1.12) Let Q={xeR?:|x|<1}, Show that the function v(x)=!x|* belongs to 
H!(Q) if a>0. 


1.13. Prove Cauchy’s inequality (1.28). 


1.14 Consider the problem corresponding to (D) with an inhomogeneous 
boundary condition, ie, the problem 

-Au=f in Q, 

O31) u=u) on’, 


where f and uo are given. Show that this problem can be given the 
following equivalent variational formulations: 
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(V) Find ue V(up) such that a(uv)=(f,v) Wve HY), 
(M) Find we V(ug) such that F(u)sF(v) Wve V(ug), 


where 
V(ug)= (ve H1(Q): v=ug on TP}. 


Then formulate a finite element method for (1.31) and prove an error 
eslimiale. 


1.6 A geometric interpretation of FEM 


We shall now give an interpretation of the finite element method in geometric 
terms in the function space H}(Q2). We recall that two elements v and w in 
a linear space with scalar product <.,.> are said to be orthogonal if 
<v, wo=0. 

Let us for simplicity consider the following variant of our previous problem 
(1.16): 


—Autu=f inQ, 


(1.32) u=0 onT, 


(cf Problem 2.5 below). The corresponding variational problem reads: Find 
ueHA(s2) such that 

JVu- Vvdx+ fuv dx=(f, v) WveH}(Q), 

2 2 
or 
(1.33) <u, v>=(f, v) VWweH4(Q), 
using the notation 

<u, v>=[Vu- Vvtuy|dx. 

a 
Note that <.,.> is in fact the scalar product in the space H}(&2), 
Let Vj be a finite-dimensional subspace of Hy(@), eg the space of piecewise 


Jinear functions of Section 1.4, and consider the following finite element 
method for (1.32): Find u,e Vp such that 


(1.34) <un, v>=(f,v) WveVn. 
Since Vic Hy(Q) we may choose veV), in (1.33) and on substraction from 
(1.34), we obtain 
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(1.35) <u-up, v>=0 VWveVp, 


ie the error u—up is orthogonal to V, with respect to <. ,.>. We may also 
express this fact as follows: The finite element solution up is the projection 
with respect to <. ,.> of the exact solution u on Vp, ie, Uy is the element in 
Vp closest to u with respect to the H!(Q)-norm || -|[Ha). or in other words 


(1.36) fla-ually'qySile-viln'a) Wve Vn. 


This situation is symbolically illustrated in Fig 1.13 where H}(Q) is repre- 
sented by the whole plane while the straight line through the origin represents 
Va. 


Fig 1.13 


According to (1.36), up is the best approximation of the exact solution u, in 
the sense that for no other function ve Vj, is the error u—v smaller when 
measured in the H1(§2)-norm. We have seen that uy can be found by solving 
a linear system of equations with right hand side depending on the given 
function f. Thus, we can compute a best approximation up of u, without 
knowing u itself, knowing only that - Aut+u=f in Q and u=0 on I. This 
remarkable fact reflects the ellipticity of the boundary value problem (1.32). 


Problem 
1.15 Prove that (1.35) and (1.36) are equivalent (cf the proof of Theorem 
1.1). 
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1.7 A Neumann problem. Natural and essential 
boundary conditions 


We shalt now consider a problem with another type of boundary condition, 
namely the following Neumunn problem (D): 


(1.37a) —Autu=f in & 
au 

1.37 pieaet 

{(1.37b} an g onT, 


where again & is a bounded domain with boundary 1’ and a denotes the 
én 


outward normal derivative to T. The boundary condition is a Neumann 
condition while the boundary condition u=ug on T considered previously is 
said to be a Dirichlet condition. In mechanics or physics the Neumann 
condition (1.37b) corresponds to a given force or flow g on T. 

We can give the problem (1.37) the following variational formulation (V): 
Find ucH!(Q) such that 


(1.38) a(u, v)=(f, v)+<g, v> VveH!(82), 
where 
a(u, v)=[[Vu- Vv+tuv]dx, (f, v= fv dx, <g, v>=f pv ds. 
Q Q v 


This is equivalent to the following minimization formulation (M): Find 
veH!'(Q) such that F(u)<F(v), Wve H'(@), where 


Fw)=4 a(v, v)-(f, v)-—<g, v>. 


To see that (1.38) follows from (1.37) we multiply (1.37a) with the test function 
veH"(Q) and integrate over Q. According to Green’s formula (1.18), we then 


OU 
et, since —= 
get, si ee 5 gonT 


(f, v)=S(-—Autu)y dx=—f Bey, dx+fVu- Vvdx+Juy dx= 
2 Tr on Q ray 
=—<g, v>+ f[Vu- Vvtuvjdx=a(u, v)—<g, v>, 
a 


which proves (1.38), 
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Let us now also motivate why a solution uéH|( 2) of the variational problem 
(1.38) also should satisfy (1.37). Using Green’s formula again we find from 
(1.38) that if u is sufficiently regular, then 


(f, v-+<g. ve=a(u, v)=f 24 y det f(-Autuyy dx, 
rT dn Q 
so that, rearranging terms, 
(1.39) f(—Autu-fjv dx+ SOX py ds=0 We H'(2). 
Q ron 


Now, as (1.39) holds in particular for all v in Hd(Q) and tor these functions 
the boundary term vanishes, we conclude that (1.37a) holds. ic, 


—Autu-f=0 ind. 
Thus (1.39) is reduced to 


fGt-pyv ds=o veH!(Q). 

pon 

But varying now v over H!(Q), which means that v will vary freely on T, we 
finally get 


du 
oA g=0 onl, 
and (1.37b) follows. 

We note that the Neumann condition (1.37b) does not appear explicitly in 
the variational formulation (V); the solution u of (V) is only required to belong 
to H!'() and is not explicitly required to satisfy (1.37b). This boundary 
condition is instead implicitly contained in (1.38), by first varying v “inside” 
we obtain (1.37a) and then (1.37b) by varying v on the boundary T. Such 
a boundary condition, that does not have to be explicitly imposed in the 
variational formulation, is said to be a natural boundary condition. This is in 
contrast to a so-called essential boundary condition, like the Dirichlet 
condition u=0 on [ in cg (1.32), that has to be explicitly satisfied in a 
variational formulation of the form (1.33). 

Let us now formulate a finite element method for the Neumann problem 
(1.37). Let then Ty be a triangulation of Q as in Section 1.4 and define 


Vn={v: v is continuous on &. vx is linear VKeTh). 


As parameters to describe the functions in V;, we of course choose the values 
at the nodes, now including also the nodes on the boundary F. Note that the 
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functions in Vp are not required to satisfy any boundary condition and that 
VncH|(Q). By starting from (1.38) we now have the following finite element 
method for (1.37): Find u,éVp such that 


(1.40) a(uy, Y=HCE, v)+<g, v> VveVn. 
As in Section 1.4 we see thal this problem has a unique solution uy, that can 
be determined by sulving a symmetric, positive definite linear system of 
equations. We also have the following error estimate 

[Ju—unlla'coy lla vl l(a) WeVh, 
and hence as above 

||u—unllit¢a) Ch, 
if u is regular enough. The function up will satisfy the Neumann condition 


(1.37b) approximatly, ie, “ will be an approximation to g on I'(cf Problem 
1.16). 


Remark When formulating a difference method for (1.37) one meets severe. 
difficulties due to the boundary condition (1.37b) unless Q has a very simple 
shape such as a rectangle. On the other hand, in the finite element formula- 
tion, the same boundary condition does nut cause any complication. O 


Problems 
1.16 Show that the problem 


—u"=f on 1=(0, 1), 
u(0)=u'(1)=0, 
can be given the following variational formulation: Find ue V such that 
(u’, v')=(f,¥) VWvev, 
where V=(veH'{I): v(0)=0}. Formulate a finite element method for 
this problem using piecewise linear functions. Determine the corre- 
sponding linear system of equations in the case of a uniform partition 


and study in particular how the boundary condition u’(1)=0 is 
approximated by the method. 


1.17 Show that the problems (M) and (V) of this section are equivalent. 
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1.18 Let Q be a bounded domain in the plane and let the boundary T of 


Q be divided into two parts [, and F. Give a variational formulation 
of the following problem: 


Au=f in Q, 
u=up inh, 
du 

ou on To, 
3n & 2 


where f, uy and g are given functions. Then formulate a finite element 
method for this problem. Also give an interpretation of this problem 
in mechanics or physics. 


1.19 Consider the finite element method (1.2) for the model problem 
(1.29). Let Gie H{(1) satisfy 


(1.41) (v', Gi=v(x) We HAI). 
where x, is a given node, i=1,. . ., M. Prove that G; is given by 


(1-x))x for OSxSxj, 
xi(1—x) for xiSx<1. 


Gi(x)= 


Note that G; is the Green's function for (1.29) associated with a 
delta function &(x)) at node x; (G, satisfies —G,"= (xi) on TL, 
G,(0)=G;,(1)=0). Further, note that it so happens that GjeVp. Now, 
by choosing v=e=uU—uy in (1.41), show that 


e(x=(e’, G)=0, i=l... M- 


Thus, up is in fact exactly equal to u at the node points x,. This some- 
what surprising fact is a true one-dimensional effect due to the fact that 
the Grecn’s function Gje Vn, and does not exist in higher dimensions. 
The technique of working with a Green’s function in this way is 
however useful in proving tor instance pointwise error estimates 
(maximum norm estimates) in higher dimensions. 


1.8 Remarks on programming 


Let us briefly discuss some of the essential features of a typical computer 
program implementing a finite element method. To be concrete we consider 
the Neumann problem of the previous section. Thus, Ict Th={K} be a 
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triangulation of the domain QcR? with boundary F and let Vp be the 
corresponding space of continuous piecewise linear functions. Let Ni, 
i=1,..., M, denote the nodes of Ty and P1.- .. @m the natural base for Vy, 
ie, (Nj) =8ij. We want to find the solution EeR™ of the linear system of 
equations 


(1.42) AE=b, 
where A=(ay), b=(b1,. . ., by), 
aj= Zak b= = ok, 


igs Or 
Kel, Kel, 


(1.43) aya f( Ves: Vgj+ gigyldx, 
bK=ftgidx+ f gqjds. 
kK Kar 


The computer program is naturally divided into subroutines carrying out the 
following tasks: 

(a) Input of data f, y, Q and coefficients of the equation. 

(b) Construction and representation of the triangulation 1}. 

(c) Computation of the element stiffness matrices ak and element loads bX, 
(d) Assembly of the global stiffness matrix A and load vector b. 

{e) Solution of the system of equations AE=b. 

(f) Presentation of result. 


Let us now consider the steps (b)—(e) in more detail. 


(b) Construction and representation of the triangulation Ty 


A program for automatic riangulation of a given domain may be based on 
the idea of successive refinement of an initial coarse triangulation; for 
example, we may retine each triangle by connecting the midpoints of each side 
(see Fig 1.14). 


A triangulation process of this type leads to guasi-uniform meshes where the 
triangles have essentially the same size in all parts of Q. If the boundary of 
Q is curved, this technique has to be modified close to the boundary. 

As discussed below. it is often desirable to be able to construct triangu- 
lations where the size of the triangles varies considerably in different parts 
of Q. In fact one would need smaller triangles in regions where the exact 
solution varies quickly or where certain derivatives of the exact solution are 
large, see Fig 1.16 where the triangles gct smaller in the arca where the 
solution has a quick variation (cf Example 1.3). A possible refinement strategy 
is indicated in Fig 1.15. Here, different coarse grid triangles are refined 
differently. Notice also the dotted lines introduced to complete the triangu- 
lation in the transition zone between regions with clements of different size. 
Recently, methods which automatically refine triangulations where needed, 
so-called adaptive methods, have been introduced, cf Section 4.6 below. 


Fig 1.15 


To represent a given triangulation Ty, one may proceed as follows: Let Ni. 
i=1,...,M,and K,,n=1,. ., N be enumerations of the nodes and triangles 
of Th, respectively. Then Ty may be specified using the two arrays Z(2, M) 
and T(3, N), where Z(j, i), j=1, 2, are the coordinates of node Nj and 
TG. n), j=1, 2. 3, are the number of the vertices of triangle Ky. As an example 
let us consider the following triangulation where the numbers of the triangles 
are indicated by a circle: 
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In this case we have 


112 3 4 5 3 3 8 7 7 
T=13 4 4 8 8 8 6 7 7 6 10}. 
4 2 5 4 5 9 7 8 9 10 9 


If we want to use Gaussian elimination to solve the system of equations AE=b 
(see Chapter 6), it is important that the nodes are enumerated in a suitable 
way. For instance, if we intend to store the stitfness matrix A as a band matrix, 
then we want the band width of A to be (nearly) minimal. 

Writing a general program for triangulation, including refinement and node 
enumeration (if needed), is a complicated task that we will not comment on 
further. Let us just note that if the geometry of Q is simple and we are satisfied 
with a quasi-uniform triangulation, then it is rather easy to write a subrautine 
for triangulation in each individual case. 

We now assume that in some way we have obtained a triangulation T}, and 
that Th is represented by the arrays Z and T as above. 


(c) Computation of the element stiffness matrices 


The next step is to compute the element stiffness matrices with elements at 
given by (1.43). We know that ak #0 only if both N, and N; are nodes of K. 
Let now KyéTp. Then T(a, n), a=1, 2, 3, are the numbers of the vertices of 
Ky, and the x;-coordinates, i=1, 2, for these vertices are given by Z(i, T(a, 
n)), a=1, 2, 3. Knowing the vertices of K, we can now compute the element 
stiffness matrix AM=(alg), a, B=1, 2, 3, for clement Ky 


ade {(V vo - Vippt worrp]dx, 
where 1p is the linear function on Ky that takes the following values: 


_}f1 ita=p 
WalNTE, 0))= {; if o#8 o, B=1, 2,3. 


We can also compute 


ba=Jfpadx! J gWa.ds, a=, 2,3. 
k, TAK, 


Thus, what we necd is a subroutine that computes the clement stiffness matrix 
A()=(agg) and right hand side b(")=(bg) for a given triangle Ka. We then 
Joop over all elements K,, and store the result on a scratch file. 
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(d) Assembly of global stiffness matrix 


To assemble the global stiffness A=(a,) we just loop over all elements K, 
and successively add in the contributions from different K, as follows (here 
A(M, M) and b(M) are arrays where the matrix A and right hand side b will 
be stored): 


Set A(i, )=0, —b(i)=0, i, j=l... M. 


For n=1,....N, fetch A()=(ag,) and b™=(b2) from scratch fille 
and set 


A(T(a, 0), T(B, n))=A(T(a, 0), T(B, n)) +agg. 
b(T(a, n))=b(T(a, n)) +bz a, B=1, 2, 3. 


(e) Solution of the linear system AE=b 


To solve AE=b we may use various variants of Gaussian elimination or 
iterative methods. This is discussed in more detail in Chapters 6 and 7. 


Remark In practice we do not use an array A(M, M) tor the stiffness matrix 
A; since A is sparse this would not be economical and would require storage 
of a large number of zero elements. Instead A is stored eg as a band matrix 
if Gaussian elimination is to be used to solve AE=b. or if an iterative method 
is used, then only the nonzero elements of A are stored (sce Chapters 6 and 
7 below). 


Remark Ina certain variant of Gaussian elimination (the frontal method) the 
assembly and elimination is carried oul im parallel which may save storage (cf 
Section 6.5 below). O 


Remark Once the stiffness matrix A for the Neumann problem (1.37) has 
been determined, for which the functions in Vj, do not satisfy any boundary 
conditions, we may directly derive the systems of equations AE =b correspond- 
ing to other boundary conditions. If on a part Py of the boundary TP we 


replace the Neumann condition Stag with the Dirichlet boundary condition 
n 


u=ug on Fy, then we obtain the corresponding system A&=b by simply 
deleting the rows in A corresponding to the nodes on Ty and by entering the 
values of § given by the Dirichlet boundary condition. O 
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1.9 Remarks on finite element software 


Writing a finite element program for a general class of problems with general 
geometry and variable coefficients (cf Example 2.7 below) is very timc 
consuming and requires expert knowledge. Therefore, much effort may be 
saved by using, at least in part, existing software. There are several general 
purpose finite element codes available for academic or commercia] use. In 
particular let us mention the codes with which we have some experience, 
namely CLUB MODULEF based at INRIA in France [CM] which is an 
extensive general purpose library of finite element routines, FIDAP (Fluid 
Dynamics Analysis Package) by M.S. Engelman [Fi] for problems in fluid 
mechanics, the adaptive multigrid code for elliptic and parabolic problems 
PLTMG (Piecewise Linear Triangular Multi Grid) by R. Bank [Ba], the 
smaller LSD/FEM package by M. Bercovier [Be] and the MACFEM program 
for the Macintosh personal computer by O. Pironneau [Pi]. These codes have 
a modular structure, clear documentation, give access to the source code and 
thus are suitable for research, development and educational purposes. 


Problem 

1.20 Write a computer program implementing the ideas of Section 1.8. 
Assume first simple geometry, eg 9 a square, and a uniform 
triangulation. Use a standard routine to solve AE=b with Gaussian 
elimination and A stored as a band matrix. 


Example {.3 Consider the Poisson equation (1.16) in a disc with radius 1 
centered at the origin and with the load f=—1 in a small disc with radius 0.25 
centered at (0.5, 0.5), and f equal to zero elsewhere. In Fig 1.16 we give the 
finite element mesh together with the level curves and the graph of the 
corresponding finite element solution obtained by applying a modification of 
the adaptive PI]. TMG-cade [Ba] to this problem, see [EJ2], [E]. PLTMG uses 
piecewise linears on triangles, and thus corresponds to (1.20). and also 
automatically refines the finite element mesh in order to control the error in 
a chosen norm. We notice that the elements are smaller in the area where the 
solution has a quick variation, cf Section 4.6 below. 

For more information on adaptive methods, see Section 4.6. Note that also 
the triangulation on the cover was generated by the modification of PLTMG 
applied to the Laplace equation with Dirichlet boundary conditions in a case 
where the exact solution has a singularity at the origin, and where accordingly 
the finite element mesh is refined. More precisely, in this case the exact 
solution is given in polar coordinates (r, 6) by 


u(r, @)=rsin(y0), yd, n 
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Fig 1.16 Solution graph (a), level curves (b} and triangulation {c) for finite element 
method for Dirichlet problem. 
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2. Abstract formulation of the 
finite element method for elliptic 
problems 


2.1 Introduction. The continuous problem 


We shall now give an abstract formulation of the finite element method for 
elliptic problems of the type that we have studied in Chapter 1. This is not 
a goal in itself, hut makes it possible to give a unified treatment of many 
problems in mechanics and physics so that we do not have to repeat in principle 
the same argument in different concrete cases. Further the abstract formu- 
lation is very easy to grasp and helps us to understand the basic structure of 
the finite element method. 

Thus, Jet V be a Hilbert space with scalar product (. , Jv and corresponding 
norm {|-||y (the V-norm). Suppose that (ef Section 1.5) a(.,.) is a bilincar 
form on VXV and L a linear form on V such that 


(i) (a.,.) is symmetric, 

(ii) af. ,.) is continuous, ie, there is a constant y>U such that 
(2.1) la(v. w)[SyilMiivilwlly Vy, we, 

(iii) a(.,.) is V-ellipuic, ie, there is a constant a>0 such that 
(2.2) aly, vyzally|% Vev. 

(iv) L is continuous, ie, there is a constant A>O such that 
(2.3) IL(v)<Allv|iv VWvev. 

Let us now consider the following abstract minimization problem (M): Find 
ueé¥ such that 


(24) F(u)=Min Flv), 


$0 


where 
F(v)=5 alv.v)-L(y), 


and consider also the following abstract variational problem (V): Find ueV 
such that 


(2.5) a(u,v)=L(¥) Vvev. 


Let us now first prove: 


Theorem 2.1 The problems (2.4) and (2.5) are equivalent, ie, ueV satisfies 
(2.4) if and only if u salisfies (2.5). Moreover. there exists a unique solution 
ueV of these problems and the following stability estimate holds 


(2.6) lulls &. 
a 


Proof Existence of a solution follows from the Lax-Milgram theorem which 
is variant of the Riesz’ representation theorem in Hilbert space theory (see 
eg [Ne], [Ci], cf also Theorem 13.1 below). The reader unfamiliar with these 
concepts may simply bypass this remark. To prove that (2.4) and (2.5) are 
equivalent, we argue exactly as in Section 1.1. We first show that if ueV 
satisfies (2.4), then also (2.5) holds, and we leave the proof of the reverse 
implication to the reader. Thus, let veV and e¢R be arbitrary. Then 
(u+ev)eV so that since u is a minimum, 


F(u)<F(u+ ev) VeeR. 
Using the notation g(e)=F(ut+ev), c€R, we thus have 
9(0)<a(e) VeeR, 


so that g has a minimum at ¢=0. Hence g’(0)=0 if the derivative g'(e) exists 
at e=0. But 


we)=5 a(utev, utev)—Ltutev) 


daca, w+ Ea(a. vt aly, wt “atv, Y—L(w-eLtv) 
2 a Qectee n apenwe nega 


NI 


2 
a(u, u)—L(u)+ea(u. v)-eL(v) + aly, v). 
where we used the symmetry of a(...). It follows that 
O=2'(O)=a(u, v)— L(y), 


5) 


which proves (2.5). To prove the stability result we choose v=u in (2.5) and 
use (2.2) and (2.3) to obtain 


allull<a(u, v)=L@)<Allully, 


which proves (2.6) upon division by ||ul|y#0. Finally, the uniqueness follows 
from the stability estimate (2.6) since if uy and uz are two solutions so that 
ujeV and 


a(uj,v)=L(v) WevV. i=], 2, 
then by subtraction we sec that uy—u2€V satisfies 
a(uj—u2,v)=0 Ve. 
Applying the stability estimate to this situation (with L=0, ie, A=0) we 
conclude that {[uj—uz/[y=0, ie, uj=u2. O 
Remark 2.1 Even without the symmetry condition (i) and with only (ii) —(iv) 
Satisfied, one can prove that there exisis a unique ueV such that 
a(u,v)=L(y) Vev, 


and the stability estimate (2.6) of course holds (cf Example 2.6 below). In this 
case there is however no associated minimization problem. O 


2.2 Discretization. An error estimate 


Now ler V; be a finite-dimensional subspace of V of dimension M. Let 
{1,.- .Qm} be a basis for Vp, so that @jeV_ and any ve V;, has the unique 
representation 


M 
(2.7) v= XI nip, where nieR. 
i-1 


We can now formulate the following discrete analogues of the problems (M4) 
and (V): Find une Vp, such that 


(2.8) F(uy)SF(v) WeVh, 
or equivalently: Find u,é Vp such that 
(2.9) a(un, v)=L{v) Wevn. 
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As in Section 1.2 we see that (2.9) is equivalent to 


a(un, 9) =L(@)), j=1,.. .. M. 


Using the representation 
M 

(2.10) ua= » Epi BeR, 
is 

(2.9) can be written as 


M 
2 APG )EALO)s I=L. My 


Or, in matrix form, 
(2.11) AE=b, 


where §=(Gi)eR™, b=(b)eR™ with bi=L(qj), and A=(ajj) is an MXM 
matrix with elements aj=a(¢,,9)). From the representation (2.7), we have 


M M M 
a(v.v)=al 2m 2 P= Z malMpnj=n - An, 


1, j=1 
M M 

L(v) =L( 2 O= Z mL(Gi)=b-n, 
where the dot denotes the usual scalar product in R™; 

7 M 

n= = Eni. 

i=l 

it follows that (2.8) may be formulated as 

i ‘ 1 
(2.12) 5 5: A&S-b-E=Min [= n- An—b-y]. 

gS ASHb-E Min’ [31 4n-b nl 
We also have, recalling (2.2), 

a Anzalv,v)zallvi[}>0, 


if v#0, ie. if n#0. Since also a(Pi,qj)=a(@y,@,), this proves the following 
result. 


Theorem 2.2 The stiffness matrix A is symmetric and positive definite. 
We can now prove the following basic result where the equivalence follows 


as above 
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Theorem 2.3 There exists a unique solution §€R™ to the equivalent problems 
(2.11) and (2.12), ie, there exists a unique solution u,€V} to the equivalent 
problems (2.8) and (2.9). Further, the following stability estimate holds: 


: A 
(2.13) llnllvs 


Proof Since A is positive definite, A is non-singular, which proves existence 
and uniqueness. The stability estimate follows by choosing v=up In (2.9) which 
gives, using (2.2) and (2.3), 


a|lun|| ySa(un th) =L(uh)<Al|unlly, 
from which (2.13) follows upon division by |Jun|lv#0. 
Remark The stability estimate (2.13) for the finite element solution, which 
is an analogue of the stability estimate (2.6) fur the continuous problem, 
reflects a very important property of the finite element method. In a certain 


sense it can be viewed as the theoretical basis for the success of the 
method. U 


Let us now prove the following error estimate: 


Theorem 2.4 Let ueV be the solution of (2.5) and u,¢Vp that of (2.9) where 
VacV. Then 


Jlu—unl| vs z |ju-v|lv WveVa. 


Proof Since Vac V we have from (2.5) in particular 
a(u.w)=L(w) YwevVn, 

so that after subtracting (2.9). 

(2.14) a(u—up,w)=0 Wwe. 


For an arbitrary ve Vp, define w=un—v. Then we Vp. V=Un—W and by (2,2) 
and (2.14), we have 


alju—upl|%<a(up, u-un)=a(u—Up, U-Un)+a(u—up, W) 
=a(u—upj, U—Un+W)=a(U—u_, U-¥)Sy}}o—upllv|lu-vi|v, 


where the last inequality follows from (2.1). Dividing by ||u—un|lv we obtain 
the desired estimate. DO 
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From the abstract qualitative estimate of Theorem 2.4 we may obtain a 
quantitative estimate by choosing a suitable function veV), and estimating 
|[u-v|ly. Usually one then chooses v=a,u where wpueV), is a suitable 
interpolant of u (eg pu may be the piecewise linear interpolant dp of Section 
1.3). In Chapter 4 we give estimates for the interpolation error ||u—aully in 
a variety of situations. 


2.3 The energy norm 


By (2.1) and (2.2) it follows that we may introduce a new norm ||- ||, on V 
defined by 


liviis=a(v, v), veV. 


This norm is equivalent to the norm || -|ly. ie, there are positive constants ¢ 
and C such that 


(2.15) elv{lv<|[vlla<Cllv||yv Ve. 


More precisely, we may choose c= V aandC=V y. Thescalar product (. ..)a 
corresponding to || - ||, is given by 


(v,W),=a(v,w). 
The norm ||- (|, is referred to as the energy norm. The error equation (2.14) 
may now be written 

(u-up, ¥)ja=0 WevVn, 
from which follows as in Section 1.3 or by the proof of Theorem 2.4, that 
(2.16) u-usllasllu-vlle VeVi 


or equivalently thal up is the projection of u onto Vp with respect to the scalar 
product (.,.)a (cf Section 1.6). Clearly (2.16) shows that un is a best 
approximation of u in the energy norm. 


2.4 Some examples 


Let us now consider some concrete examples of the form (2.5). In Chapter 
5 further examples from mechanics and physics will be presented. Let Q be 
a bounded domain in R? or R3 with boundary F The coordinates in R? and 
R3 are denoted by x=(x), x2) and x=(x;. x2, x3). 
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Example 2.1 Let V=H'(Q), QeR?, 
a(v, wW)=f[Vv- Vw+vw]dx, 
Q 


Liv) =Jfivdx, 


where fe L2{Q) in which case (2.5) is a variational formulation of the Neumann 
problem (1.37) with g=9. Let us verify that the conditions (i)—(iv) above are 
satisfied. Clearly a(. ,.) is a symmetric bilinear form on Vx V and L is a linear 
form. Further, 


aly, V=livlhitay 
and by Cauchy's inequality 
a(v,w)s<a(v,v)"a(w,w)!?=||v|le(ayllLlH(Q), 
which proves (2.1) and (2.2) with a=y=1. Finally 


Losi fv dx|<|/f{!c,¢o)||¥] LQ), 


which proves (2.3) with A=(IflIL(@)- 


Example 2.2 Let V=H)(D), (I=0, 1), 
a(v, w)=Jviw'dx, L(v)— fv dx, 
i 1 
where feL2(1) is given, which corresponds to our introductory boundary value 


problem (1.30). To verify that (i)—(iv) are satisfied, we first note that a(. , .} 
is obviously symmetric and bilinear and L is linear and since 


la@.w)lsllv ell Laslett, 


we have that a(. , .) is continuous. The continuity of L follows as in Example 
2.1 and it thus remains to prove the V-ellipticity (2.2), ie, the inequality 


Q.17) Siv'Pdx2alfv2dx+ f(v'Ydx) VWeHA(l), 
I i] if 


for some positive constant a. We shall prove that 


(2.18) Jvtdxs f(v')2dx Wwe HAI), 
[ I 


from which (2.17) follows with a=}. Since v(0)=0 for veHg(l), we have 


vod=v(0) + Jv'(yay= Jv" (y)dy, 
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so that by Cauchy's inequality 
U i 1 1 
weal sfiv'ldys(f aye Yedy P= tv yey)" 


Squaring this inequality and then integrating over I we obtain (2.18). We notc 
that the inequality (2.18) does not hold for v(x)=1, in which case the teft hand 
side is 1 and the right hand side 0. Thus we need eg a boundary condition 
of the form v(0)=0 for (2.18) to hold in order to control the norm of the 
function v by the norm of the derivative v’, ic, we need a “fixed point” to 
start from. 

If we choose Vj to consist of piecewise linear functions on T as in Section 
1.2, we obtain in this case 

|[u—up|[r'aysCh, 


if wis smooth enough. O 


Example 2.3 Let V=H}(Q), QCR?, 
a(v,w)=f Vv» Vw dx, L(v)=Jivdx, 
Q Q 
where fe L2(Q), in which case (2.5) is a variational formulation of the Dirichlet 


problem (1.16) for the Poisson equation. We directly see that (i), (ii) and (iv) 
are satisficd in this case. Thus, only the V-cllipticity, ie, the inequahty 


(2.19) [| VvPdx=a(w,v)2allv||foy=at f 2+] 0 vi2)dx) 
Q gQ 


Tequires comment. To prove (2.19), it is sufficient to prove that there is a 
constant C such that 


(2.20) fvdx<Cf|VvPdx Wve H)(Q). 
Q Q 


since then (2.19) follows with oat. The proof of (2.20) is analogous to 


the proof of (2.18) (cf Problem 2.1 below). With the Vp of Section 1.4 we 
obtain the error estimate 


Ju-unlli'(o) Ch, 


if wis sufficiently smooth. C1 


béxample 2.4 Consider the following boundary value problem 


4 
(2.214) Cons for xeI=(0, 1), 
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(2.21b) u(0)=u'(0)=u(1)=u'(1)=0, 

where feLa(I) (cf Problem 1.5). We introduce the space 
H?(1)={veL,(I): v’, v’eL(D}, 

with norm 


Mls Clr P+ Pax? 


and the space 
Ha()={veH'(): ¥(0)=v'(0)=0(L)=v"(1}=0} 


with the same norm. The problem (2.21) can now be given the variational 
formulation: Find ueV such that 


a(u,v)=L(v) Yvev, 
where V=H3(2), 

atem)= [viw'dx, Uy)=fiv dx. 
We see that the conditions (i), (ii) and (iv) are satisfied. By (2.18) we have 
for ve Ho(1) 

Jvtdns f(v' Pans f(v' dx, 


since v(0)=v'(0)=0, which proves that 


Mitra 3 ("Pax =3 av, v), 
1 


and (iii) holds with a=}. a 
We now introduce some notation that will be used below. We define 
lal 
Dey= gitly ; 
Oxf" Ox52 


where here a=(a;, @2), (i is a non-negative natural number and lal|=a,+a2. 
Asan example, a partial derivative of order 2 can then be writlen as D®v with 
a=(2,0), a=(1, 1) or a=(0, 2), which are the a with |a|=2. We now define 
fork=1,2,..., 


H*(Q)={veL2(Q): D¢veL2(Q), |a|<k}, 
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with norm 


Ivl(ay=C 2 f/Devi2dxy "2. 
la|sk & 


Thus the space H*(£2) consists of all functions v on Q that, together with the 
partial derivatives D®v of order |a! at most k, belong to L2(@2). ‘lhe space 
HY(Q) is a Hilbert space with the indicated norm and corresponding scalar 
product. The spaces H*(Q2) are examples of so called Sobolev spaces named 
after the Russian mathematician S. L. Sobolev 1908-. ef [Ad]. 


Exumple 2.5 Let us now consider a fourth-order problem in a two-dimensio- 
nal domain 2, namely the biharmonic problem: 
(2.22a) Adu=f in &, 


(2.22b) u=t=0 on T, 


where & denotes differentiation in the outward normal direction to the 


boundary f°. This problem gives a formulation of the Stokes equations in fluid 
mechanics (cf Problem 5.3) and also models the displacement of a thin elastic 
plate, clamped at its boundary, under a transversal load (cf Problem 5.4). To 
give a variational formulation of (2.22). we introduce the space 
HU(Q)= (ve HQ): v=24=0 on 1). 
nN 
Now we multiply (2.22a) with vell}(Q) and integrate over Q. By Green’s 


formula as v=3=0 on Tl, we have 
n 


Jfvdx=f A Au v dx= 

QR Q 

= pe (Au)v ds- JV(Au)- Vu dx= 
r on Q 


=- fV(Au) Vvdx=— fAu ov ds+ fAuAvdx=f AuAv dx. 
Q ro ong a 


We are thus led to the following variational formulation of the biharmonic 
problem (2.22): Find ue¥V such that 


a(u, vJ=L(y) VWev, 
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where v=H?2(Q) and 


a(u. v)=fAu Av dx, L(v)= ftv dx. 
2 2 
Again we see directly that (i), (ii) and (iv) are satisfied in this case and the 
V-ellipticity (iii) can easily be proved using the hints of Problem 2.2 below. 


In Chapter 3 below we shall construct finite element spaces V,>cHA(Q). O 


Example 2.6 Consider the following problem in a domain QcR?: 


(2.23a)  —pAu+p, S448, 58 yy | 
ad Bi 3x, +B2 Bx) +u-F in Q, 
(2.23) u=0onT, 


where yz and the 8; are constants with > 0. This is an example of a stationary 
convection-diffusion problem; the Laplace term corresponds to diffusion with 
diffusion coefficicnt 4 and the first order detivatives correspond to convection 
in the direction 8=(f,, Bz). Let us here assume that #1 and that the size 
of |B| is moderate (for convection-diffusion problems with |Bl/e large, see 
Chapter 9). By multiplying (2.23a) by a test function veV=HI(Q) integrating 
over 2 and using Green's formula for the Laplace-term as usual. we are led 
to the following variational formulation of (2.23): Find ueV such that 


(2.24) atu. v)=L(v)  Wev, 


where 


av, w= f(Vv- Vw (6, 2% 48, 5¥ = 
K w+(Br a +By ay x, Lv)= ftv dx. 
It is clear that af. ,.) is V-elliptic since if ve V, we have by Green’s formula: 


3 at 
SB vt Pe Buy DOYS >2(Bue + Banas 


cS 
~ Sev By) sy By axe pep, 8% v4.3, SY 
Q 8x1 Bo Ba) ‘ {Gs Bx, aque 


ie, 
ov av 
the = 
SB oF wre dx=0, 
so that 
a(v, Y= {1M vP+v'}ds= Iv lic, 
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Existence of a unique weak solution of (2.23) now follows from Kemark 2.1. 
Starting from (2.24) we may formulate the following finite clement method 
for (2.23): Find upe Vp such that 


(2.25) a(uy, VJ=L(v) WeVh, 


where Vy, is a finite-dimensional subspace of V. If {g1.. . ., gM} is a basis 
for Vp we have as above that (2.25) is equivalent to the linear sysiem AE=b 
where A=(ajj). aj=a( 9). and b=(bj). b;=(f,@i). Note that in this case the 
matrix A is not symmetric. 

By the V-ellipticity it follows that solutions of (2.25) are unique and thus 
A is non-singular so that AE=6 admits a unique solution, ie. there exists a 
unique solution uy of (2.25). By the same argument as in the proof ot bheorem 
2.3, we also have the error estimate (here ¢=1)}: 


|lu-unllHeysyllu-vilniay VveVy. O 


Example 2.7 Let u be the temperature in a heat conducting body occupying 
the domain QcCR3. We have in the stationary case the following relations: 


(2.268) y= ki(xy Sh in Q, i=1, 2,3. (Fourier’s law), 
6xj 


(2.26b) div y=t in 92 (conservation of energy). 


where the q, denotes the heat flow in the x-direction, kj(x) is the heat 
conductivity at x in the x-direction and f(x) is the heat production at x. If 
kiQ)=1, xeQ.i=1, 2.3, ie, if the heat conductivity is constant and equal in 
all directions, then eliminating q in (2.26), we obtain Poisson’s equation 
—Au=f in Q, With the kj non-constant, (2.26) is an example of a partial 
differential equation with variable coefficients. However, the coefficients k; 
arc not assumed to depend on the solution u. If this was the case and the heat 
conductivities k, depended on the temperature u. then (2.26) would be an 
example of a non-linear partial differential equation, see Chapter [3 below. 

Let us now give a variational formulation of (2.26) which in the usual way 
can be used to formulate a finite element method for (2.26). This shows that 
the presence of the variable coefficients k, do not introduce any difficulties. 
We complement (2.26a, b) with the following boundary conditions: 


(2.260) u=0 onTy, 


(2.26d) —q-:n=g on T3. 


61 


where [=I UL? is a partition of the boundary [ and n denotes the outward 
unit normal to [. The condition (2.26d) corresponds to a situation where the 
heat flow is given on ['2. 

We introduce the space 


V=(veH!(Q): v=0 on Ty}, 
multiply (2.26b) by ve V and integrate over Q. By Green's formula we then 
get 

Jfv dx=Jv div q dx=fvq-n ds— fq: Vv dx= 

Q Q i Q 


du dv 
=f2h_, k(x) <4 = ax-f gv ds, 
f jet Ki(x) ax, Ox, fe 
where the last equality follows from (2.26a), (2.26d) and the fact that v=0 
onl). Thus we are led to the following variational formulation of (2.26): Find 


ue¥ such that 
(2.27) a(u.v)=L(v) VWev, 


where 


‘ av ow 
=] Dk(x) — — dx. 
agree) i2i i) oxi Ox, 


L(v)=Jfv dx+ fgv ds. 
Q ry 


We easily verify that the conditions (i)—(iv) are satisfied under the following 
hypothesis: There are positive constants c and C such that 


e<kj(x) SC, xeQ, f= 1, 2, 3, 


feLp(2), geLo(T2), and the area of Ty ts positive. 

Starting from (2.27) we may now formulate a finite element method for 
(2.26) by replacing V by a finite element space VycV. This leads to a linear 
system AE=b with stiffness matrix A= (ay) with elements aij= a(qi.cpj) where 
{1,. ., Om} is a basis for Vp. To find the aj we have to compute integrals 
involving the variable coefficients kj(x). In practice we may fur this purpose 
want to use numerical quadrature, cf Chapter 12, G 


Problems 
2.1 Let Q be a square with side 1. Show that 
(fvdx)!@<(f| VvPdx)'2 We Hi(Q). 
Q a 
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2.2 


23 


2.4 


2.5 


Let Q be a square with boundary T Show that there is a constant C 
such that 


Iv ay SC f(A wax Wve H?(Q), 
Q 


by using the houndary conditions v=2¥ 20 on F and the fact that by 
Green's formula, m 
av Bey 


32v 5 
dx= f dx WveH}(Q). 
baa? Sam ax3 velo?) 


Note that if v=0 on T, then also Bog on I, where Fc 
3s ds 


differentiation in a tangental direction to [. 

Give a variational formulation of the problem 
dtu 
dx‘ 
u(0)=u"(0)=u' (=u (1) =0, 


f for 0<x<1, 


and show that the conditions (i)—(iv) are satisfied. Which boundary 
conditions are essential and which are natural? What is the interpre- 
tation of the boundary conditions if u represents the deflection of an 
elastic beam? 


Let S¢ be a square with boundary [. Show that there is a constant C 
such that 


({v2ds)!2<C|v|ln'(ay ¥vell'(Q). 
ie 


Using this result show that the linear functional L:H'(Q)—> R defined 
by 


Li)=J ev ds 
r 
is continuous if geL2(L), ic. if J gds<eo. 
r 


Give a variational formulation of the inhomogeneous Neumann 
problem 


—Autuef in Q, 
ou_ 
3078 onT, 
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and check if the conditions (1)—(iv) of Section 2.1 are satisfied. Give 
an example of a problem in mechanics that takes this form. 


2.6 Give a variational formulation of the problem 


—Au=f in Q, 
du 
+= oT, 
oS on Y . 


where y is a constant. When are conditions (i) —(iv) satisfied? Give an 
interpretation of the boundary condition (which is sometimes referred 
to as a Robin (or third type) boundary condition). 


2.7. Consider the variational problem (2.27) with variable coefficients. 
Suppose that Q is composed of two parts $2; and 622 with common 
boundary S (see Fig 2.1) and suppose the coefficients kj(x) are defined 
by 

% for xeQy, 


KO= %2 for xEQz2, 


where the x; are positive constants. 


Fig 2.1 


In this case (2.27) models stationary heat conduction in an isotropic 
body composed of two materials with heat conductivity coefficients x, 
and 2 occupying the regions Q) and 22. Show (formally) that ueV 
satisfies (2.27) if and only if 


(2.28a) -njAu=f in Q), j=1, 2, 
(2.28b) u=0 onT), 
(2.28¢) q:n=g on To, 
(2.284) mM Sia ons, 

an én 


Suj ee F Boned 
where —+ denotes the derivative of uj=ulg, in a direction n normal 
to S. 
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2.8 


2.9 


Notice that (2.28d) represents a balance of heat flowing between 2, 
and 29. Observe that this relation is “automatically built in” in the 
variational formulation (2.27). 


Show (formally) that u is the solution of the variational problem 
ate], 
2.29) Min [5 s")2dx—f v dx 
(2.29) Min Gb FRON(v dx ie dx], 
where [=(0, 1), and 
1 ifxeli=(, d, 
k(x)= 


B aeeeee 
5 if xeh=(5, 1), 


if and only if u satisfies 


~k(x)u"(x)=1 in T, and ly, 


(2.30) ujeuy 2s te for gk 
dx dx 2 


u(0)=u(1)=0. 


where ui=ul),, i=1. 2. Then formulate a finite clement method for 
(2.30) using piecewise linear functions. Determine the corresponding 
linear system in the case of a uniform partition and give an Interpre- 
tation of this system as a difference method for (2.30). 


Show that if u is the solution of the Dirichlet problem 


~Au=f in Q. 
u 


2:31 
( ) =0 on F, 


where feLo(Q) and QcR%, then p=Vu is the solution of the 
minimization problem 


2 Slr 5 
2.32) Min + flg/? 
(2.32) Min 5 flal dx, 


where 
Hr=({qeH: div q+f=0 in Q}, 
H={q=(q1, G2): uieLo(2)}. 


The minimization problem (2.32) corresponds to the Principle of 
minimum complementary energy in mechanics. Starting from (2.32), 
replacing Hr by a finite-dimensional subspace, onc may construct finite 
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2.10 


2.11 
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element methods of so-called equilibritan type (for such a method the 
equilibrium condition div q+f=0 will be satisfied exactly in the discrete 
model). Methods of this type may in certain cases have advantages as 
compared to the conventional finite element methods, so-called 
displacement methods, that we have studied above (in a displacement 
method for (2.26) the compatibility relation (2.26a) is satisfied 
exactly). Hint: First show that pe Hris a solution of (2.32) if and only 
it 

J p-qdx=0 WqeHo, 

Q 


where Hy={qeH, div q=0 in 9}. 

Solve Problem 2.3 with the following alternative boundary conditions: 
u(0)=—-u"(0) + yu'(0)=0, u(1)=u"(L)+ yu") =0, 

where ¥ is a positive constant. Also give a mechanical interpretation 

of the boundary conditions. 

Consider the Neumann problem 

(2.33a) —Au=fin Q, 


(2.33b) 2 =gonr, 
on 


(2.33) fudx=0. 
Q 


Note that if u satisfies (2.33a, b), then so does u+c for any constant 
c, and that the condition (2.33c) is added to give uniqueness. Give a 
variational tormulation of (2.33) using the space 


V=(veH!(92): J vdx=0}, 
Q 


and prove that the conditions (i)~(iv) are satisfied. 


3. Some finite element spaces 


3.1 Introduction. Regularity requirements 


We shall now present some commonly used finite element spaces V},. These 
spaces will consist of piecewise polynomial tunctions on subdivisions or 
‘triangulations’ Ty={K} of a bounded domain QcR4, d=1, 2. 3. into 
elements K. For d=1, the elements K will be intervals, for d=2, triangles or 
quadrilaterals and for d=3 tetrahedrons for instanec. 

We will need to satisfy either Vic H'(Q) or Vic H2(Q). corresponding to 
second order or fourth order boundary value problems, respectively, Since 
the space Vp consists of piccewisc polynomials, we have 


(3.1) Vic HQ) 6 Vac CB). 
(3.2) Vic H2(Q)2 VacC'(Q). 


where Q=QCTM and 
€(Q)={v:v is a continuous function defined on Q}, 
C'Q)= (veC(Q): DvEC(Q). [al=1}. 


Thus, VacH!(Q) if and only if the functions veV, are continuous. and 
Vic H?(Q) if an only if the functions veVy ad their first derivatives are 
continuous. The equivalence (3.1) depends on the fact that the functions v 
in Vj, are polynomials on each element K so that if v is continuous across the 
common boundary of adjoining elements, then the first derivatives Dev, 
Jal=1. exist and are piecewise continuous so that vcH'(9). On the other 
hand, if v is not continuous acrass a certain inter-element boundary, ic 
v¢C(Q), then the derivatives D¢v, Ju|=1, do not eaist as functions in L2(Q) 
and thus v¢H!(Q) (if v is discontinuous across an element side S, then D@v, 
|aJ=1, would be a 6-function supported by $ which is not a square-integrable 
function). In a similar way we realize that (3.2) holds. 
To define a finite clement space V; we will have to specify: 


(a) the triangulation Th={K} of the domain Q, 


67 


(b) the nature of the functions v in Vj on each element K (eg linear, 
quadratic, cubic, etc), 


(c) the parameters to be used to describe the functions in Vn. 


3.2 Some examples of finite elements 


Let us now consider some examples. We first consider the case when 9 is a 
domain in the plane R2 with polygonal boundary [. Let Th={K} be a given 
triangulation of & according to Section 1.4 into triangles K. We shall use the 
following notation for r=0, 1, re 


P,(K) ={v:v is a polynomial of degree<r on K}. 


Thus, P)(K) is the space of linear functions defined on K, te, functions of the 
form 


v(x) =a90+ 410%1+401%2, EK, 
where the ajeR. We see that {qr Y2, 3}, where 
Wi(XE1, a(x) =X1. Y3O)= X2.- 
is a basis for P,(K), and that dim P\(K)=3, where dim W denotes the 


dimension of the linear space W. : ; 
Further, P2(K) is the space of quadratic functions on K, ic, functions of the 


form 
2 
v(x)=ago+ aqox1 ing nx2-+.a29x] +a11X1X2 + ag2%3, XEK, 


where the ajeéR. We see that {1, Xt. X25 xi, X)X2, x3} is a basis for P2(K) and 
that dim P2(K)=6. In general we have 


P(K)=(v : (x)= 2 ayxixd for xeK, where a,€R}, 
O<i+j<r 


and 
: +1) (142 
dim p(k 
Example 3.1 Let 7 
(3.3) Vaz {veC(Q): vikePi(K), VKeTh}, 
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ie, Vn is the space of continuous piecewise linear functions that we have met 
in Section 1.7. As parameters, or global degrees of freedom, to describe the 
functions in Vy, we choose 


(3.4) the values at the node points of Tp, 


(including the node points on I). Let us now convince ourselves that this is 
a legitimate choice and show that a function ve Vp, ts uniquely determined by 
the values (3.4). This is of course intuitively quite obvious but let us anyway 
carry out the argument in detail here, since it will be a model to be used in 
more complicated situations below. We then first notice that if KeTn is a 
triangle with vertices a’, i=l, 2, 3, then the degrees of freedom for K 
corresponding to (3.4), ie, the element degrees of freedom, are 


G.5) the values at the vertices a’, 1=1, 2, 3. 
‘To show that a function ve Vp is uniquely determined by the degrees of 


freedom (3.4) it is sufficient to show: 


Theorem 3.1 Let KeTy be a triangle with vertices ai=(ai, aj), 1=1.2.3.A 
function veP)(K) is uniquely determined by the degrees of freedom (3.5), ie. 
given the values @,, i= 1, 2, 3. there is a uniquely determined function veP\(K) 
such that 


(3.6) v(ai)=aj i=1, 2,3. 


Proof Since v(x)=c1x1 +¢2x2+¢3 for some constants cieR, (3.6) is equivalent 
to the linear system of equations 
G.7) caiteaites=aq,  i=1,2,3 


in the unknowns ¢;. This system has a unique solution tor given qj if and only 
if the determinant detB of the coefficient matrix 


al 1 
B= | a? aol 

; a | 
is different from zero. However by basic linear algebra 
(3.8) detB/2=area of K, 


and thus detB#0. Hence B is non-singular, which proves the desired result. 
Since this argument will be used below, we alsu give a somewhat different 
version of this proof. We notice first that 
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dim Py(K)=number of degrees of freedom (=3), 


ie, (3.7) has the same number of unknowns as equations. In this case it follows, 
again by basic linear algebra, that detB+0 if and only it solutions of (3.7) are 
unique, or in other words if the only solution of (3.7) with o,=0, i=1, 2, 3, 
is given by ¢j=0, i=1. 2. 3, or formally: 


(3.9) If veP\(K) and v(a')=0, i=1, 2, 3, then v=0. 


In fact it is easy to prove (3.9) directly without using (3.8), which shows that 
we do not have to be able to compute detB in order to prove that detB +0. 
As we shall see below. this latter method of proof makes it possible to easily 
prove analogues of Theorem 3.1 for higher order polynomials in which case 
a direct computation of the determinant of the corresponding coefficient 
matrix could be very complicated. O 
We can now determine the (nodal) basis functions for P)(K) associated with 
the degrees of freedom (3.5), ie, the functions 4;¢P{(K), i=1, 2, 3, such that 
(see Fig 3.1): 
F 1 if i=j to 
i)=8,= = 
d(aly=dy {5 if i4j i, j=l, 2, 3. 
A function v(x)éP)(K) then has the representation 
3 
(3.10) vO)= E v(ahi(x) xeK. 
i=l 


To determine the basis functions 4;, we have to solve the system of equations 
(3.7) for three special choices of right hand side, namely, (1, 0, 0), (0, 1, 0) 
and (0, 0. 1). 


LX ee 
2 5 
Z a 

A dy a 


3 


Fig 3.1 


The basis tunction A;, say, can also be determined as follows. Let 
dx) +dox2+d3=0, 

be the equation for the straight line through the vertices a2 and a3. Then 
Ai (x) =y(dixi + dox2+d3), 
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where the constant y is chosen so that A,(a')=1. In the same way we may 
determine A, and A3. If the triangle K has vertices at (1.0), (0, 1) and (0, 0), 
then 4)=x), Az=x2 and 43=1—x),—x2. The notation Ay, Az and A3 for the nodal 
basis functions for P;(K) will be kept below. 

Given the choice of global degrees of freedom in (3.4), it is natural to 
describe the space Vp given by (3.3) alternatively as 


G.11) Vn={v: vikePi(K), WKeTh, and v is continuous at the nodes}. 


We then view a function ve Vj as a piecewise linear function taking on certain 
values at the nodes of T,. Let us be careful and check that (3.11) defines the 
same space as (3.3) above. We need to check if a function vc Vp, according 
to (3.11) is continuous, ie, if veC°(Q). Clearly, it is sufficient to check that 
v is continuous across al! intcrelement sides. Thus, let K} and K> be two 
triangles in T, having the common side S with the end points Ny and Ng, say. 
Suppose now ve Vp according to (3.11) and let vi=v]k,EP1(K,), i= 1, 2, be the 
restrictions of v to the Kj. Then the function w=v,—v2 defined on § vanishes 
at the end points Nj and No and since w is linear on § it follows that in fact 
w vanishes on S. Hence, v is continuous across § and we obtain the desired 
conclusion that ve Co) 


Example 3.2 Let us now show how to construct a space Vj), using piecewise 
quadratic functions v, ie, vIKeP2(K). Let us first specify the clement degrees 
of freedom. Let KeT) be a triangle with vertices al, i= 1, 2, 3, and denote the 
midpoints of the sides of K by a'l, i<j, i, j=1, 2, 3, see Fig 3.2. 


23 


o 


Fig 3.2 a 


We shall prove 


7) 


Theorem 3.2 A function veP2(K) is uniquely determined by the following 
degrees of freedom: 


v(a'), i=1, 2, 3, 
G12) vai), de], J=1 2, 3 

Proof Since dim P2(K) is equal to the number of degrees of freedom (=6), 
it is (see the proof of Theorem 3.1) sufficient to prove that if veP2(K) and 


(3.13) v(aij=0, v(ai)=0, i<j, i, j=1,2,3, 


then v=0. To this end, consider the side a2a3_ Along this side the function 
v has a quadratic variation and v vanishes at the three distinct points a2, a3 
and a. Thus, (cf Problem 3.1) v vanishes identically on a2} which means (cf 
Problem 3.3) that we can “factor out” the function 4, and write 


v(x) =Ai(x)wi (x), xeK, 


where w)eP\(K) and Aj, i=1, 2. 3, are the basis functions for P)(K) according 
to Example 3.1. In the same way we see that v also vanishes along the side 
ala? which means that we may also factor out the function Ag, so that 


v(x)=Ai(x)A2(x)wo, —-xEK, 


where now wo has degree zero, ie, Wo=y=constant. If we now finally take 
x=a!?, we see that 


O=v(a!?)=yhi(al?yin(al’)=y 3-4, 


so that y=0 and hence v=U and the proot is complete. O 
A function veP2(K) has the representation 
3 3 
(3.14) v= = v(aljhi(2a- 1) +E v(ail)4aidj. 
iv} i..jod 


i<j 


To see this, by Theorem 3.2 it is sufficient to check that the night hand side, 
RH, and left hand side, LH. of (3.14) take the same values at the node points 
ai and aij, since the difference LH-RHeP2(K). From (3.14) it is clear what 
the nodal basis functions for P2(K) corresponding to the degrees of freedom 
(3.12) are: the basis function corresponding to a particular degree of freedom, 
the valuc at the vertex a! for instance, is of course the function peP2(K) such 
that w(a')=1 and wp vanishes at the other five points ai. ali (see Fig 3.3). 


Fig 3.3 Different basis functions for P2(K) 


Let us also show that if vjeP2(Kj), i= 1, 2, where Ky and K2 are two triangles 
with the common side S. and v; and vz take the same values at the end points 
and the mid point of S. then vy; and v2 agree on S. But this follows immediately 
from the fact that w=vy—v2 varies quadratically along S and w vanishes at 
three distinct points on § so that w=0 on S. 

Defining now 


Vn= {ve C(Q): vigePo(K), VKEThI, 


we have seen that the global degrees of freedom of the functions ve V}, can 
be chosen as tollows: 


(i) the values of v at the nodes of Tj, 
(ii) the values of v at the mid points of all the sides of the triangles 
in Th. 


The corresponding global basis functions have the following form: 


LAS > 


Fig 3.4 


Example 3.3 We now define a space V, using piecewise cubic functions. 1¢, 
functions v such that vixeP3(K), WKeTp. Let K be a triangle with vertices 
al, i=], 2, 3, and define (see Fig 3.5): 


ati=l Qaltai), i j= 1,203, i) 
amt (a'+a? ta’). 
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Fig 3.5 


We have 


Theorem 3.3 A function veP3(K) is uniquely determined by the following 
degrees of freedom: 


v(ai), vali), i, j=1, 2,3, tj, 


(3.15) v(a!23), 


Proof Since dim P3(K) is equal to the number of degrees of freedom (~10), 
it is sufficient to show that if veP3(K) and 


(3.16) v(a')=v(a') =v(a!23)=0, i, j=1, 2,3, 4], 


then v=0. Observe that if v has a cubic variation along the side a2a> then v=0 
on a2a?, In the same way it follows that v vanishes on the sides a!a? and a!a2 
and hence 


V(x) = yh x) ha(x)A3(X), 


where ¥ is a constant. If we now choose x=a"3_ we get from (3.16) 


3 
so that y=0 and thus v=0. O 


Now let v,eP3(Kj). i=1, 2, where Ky and K3 are two triangles with common 
side S and suppose that v; and v2 take the same values at the end points and 
the two points alli of §. Since v,—¥> varies cubically on § it follows that vj=v2 
on § (see Fig 3.6). 
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Fig 3.6 


We can now introduce the space 
Vaz (veC(&): vine Ps(K). VK eT}. 


with the following degrees of freedom: 


(i) the values of v at the nodes of Th. 
(ii) the values of v at the points a! on the sides of Th, 
(iii) the values of v at the center of gravity for all KeTp. 


Example 3.4 There is another way of choosing the degrees of treedom for 
P(K), where K is a triangle with vertices a! i= 1, 2, 3, and center of gravity 
al23, We have 


Theorem 3.4 A function veP3(K) is uniquely determined by the following 
degrees of freedom: 


v(a'), i=], 2, 3, 
ov : : 
AT. — (a'), =1, 2,3. j=1, 2, 
Gin Fan 
v(al?4), 


Proof Since again dim P3(K) is equal to the number of degrees of freedom, 
it suffices to prove that if veP3(K) and 


(3.18) v(ai)= (a')=v(a'28)=0, i=1, 2, 3. j=l, 2, 
x} 
then v=0. It follows from (3.18) that 
BV ary = OY (ats + &Y (a)q=0. i= 1, 2,3 
5,8) 8x; Ot 3g Oe =, 2,3, 
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where x is the derivative in a direction s=(s). $2). In particular we then have 
iS 
8V/42)_ 9¥ 7.3 
—(a4J= —(a*)=0, 
352? 35! ) 


where s js the direction from a? to a3. Together with the fact that v(a2) =v(a3) 
this shows that v vanishes along the side aa? since v varies as a cubic 
polynomial along this side. In the same way see that v vanishes on ala? and 
ala} and the argument is then completed as in the proof of Theorem 3.3. 0 


We further note that if v;eP3(K;), i= 1, 2, where K, and K2 are two triangles 
with the common side S with endpaints Nj=1, 2, and v, and v2 agree 1ogether 


with the first derivatives a (Nj) and oS (Nj). i, j=1, 2, then vj=vp on S, 
Xx Xy 
The corresponding finite clement space Vy>cC%(Q) is given by 
Vn={v: vikePs(K), WKeTp, and v and 


ove. i=]. 2. are continuous at the nodes}, 
8x; 
with the following degrees of freedom: 
(i) the values of v and =, i=, 2, at the nodes of Th, 
IX 
Gi the values of v at the center of gravity of each KeT,. O 


Example 3.5 Let us now consider a finite element space Vp, satisfying the 
condition V,cC'{Q). We will then work with functions that are polynomials 
of degree five on cach triangle; with polynomials of lower degree, special 
constructions are required to satisty the C!-condition. 


Theorem 3.5 Let K be a triangle with vertices a', i=1, 2, 3 and let a! be the 
midpoint on the side aial. i, j=1, 2, 3. i<j (see Fig 3.2). A function ve Ps(K) 
is uniquely determined by the following degrees of freedom: 


Dev(al), i=1, 2, 3, [al s2, 


G19) (ai), i, J=1, 2. 3.15), 


3 , uke oa Bee 
where an denotes differentiation in the outward normal direction lo the 
n 


boundary of K. 


Proof Since dim P5(K) is equal to the number of degrees of freedom (=21), 
it is sufficient as usual to prove that if all the degrees of freedom according 
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to (3.19) are zero, then v=0. To see this, we first note that if s denotes the 
direction of the side a’a3. then 


bv wae 
(3.20) v(al)= aaa) = Bette i=2, 3. 


Since v is a polynomial on the side aza3 of degree at most 5S, it follows that 


yv vanishes on a?a}. Further, x is a polynomial of degree at most 4 on aa? 
n 


and 


3 (9 a _ 
(3.21) 2 }= Maya 3 [Bl ay=o, i=2, 3, 


which is only possible if ov 9 on a2a3. Thus, both v and x vanish on a2a> 
on 


which means that we may factor (1(x))? out of v(x) (check this in the special 
case when a2a3 lies on the x2-axis). Therefore 


v(x) = (hi (8))?p3(X), xeK, 
where p3eP3(K).. In the same way we see thal we may also factor out Qi), 
i=2, 3, and thus 


v=yMiian3, 


where yeR. But vePs(K) and the only possibility then is that y=0 so that v=0 
on K. 


Now let ¢Ps(Kj), i= 1, 2, where K, and K2 are two triangles with common 
side S and suppose that 


D%y,=D4v. at the endpoints of S, |]<2, 


chilwicats at the midpoint of S, 

on on 

where 2 denotes differentiation in the normal direction to $. Then we have 
on 


the relations (3.20) and (3.21) for the difference w=vj—v2 and it follows that 
aw 
==—=0 on S. 
(3.22) oer 


But if w=0 on S we also have that 


(3.23) 3Y=0 on, 


iS 
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where 2 denotes differentiation in the direction tangential to S. By (3.22) 
s 


and (3.23) we see the function v defined by v|x =v; varies continuously across 
S as do its fitst derivatives. i 
We may now define the space Vyp>cC!(@) as follows 
Va={vi vigePs(K), VKETp, D&v is continuous at the nodes for 


ja|<2 and = is continuous at the mid points of each side}, 
mn 
with the degrees of freedom of (3.19). 


Example 3.6 Let us now construct a three-dimensional finite element. We 
then assume that 9 is the union of a collection Ty={K} of non-overlapping 
tetrahedrons K such that no vertex of one tetrahedron lies on a side of another 
tetrahedron. As above, forr=1, 2,..., and KeTh. we define 


P(K)=(v: vis a polynomial to degree <ron K, ie v has the form 
V(x)= LX aiimX]X3X9, diimeR}- 
itjtmer 
For r=1 a function veP)(K) is uniquely determined by the values v(a'), 
i=1,...,4, where the aiare the vertices of K. We can then introduce the space 


Van {veC(Q): viKePi(K), UKeT}, 


and as global degress of freedom we may take the values at the nodes of Tj, 
poins. O 


Example 3.7 Let us also consider some rectangular finite elements that can 
be used for example if QcR? is a square. Let then K be a rectangle with 
vertices a’, i=1,.. ., 4, and with sides parattel to the coordinate axis in R?. 
Define 

O.(K)={v: v is bilinear on K, ie, v(x)= apo t ayox) Fag x2 + aX 142, 

xeK, where the aycR}. 

It is easy to see (prove this!) that a function véQ(K) is uniquely determined 
by the values v(ai), i=1,..., 4. Further, if Ky and Kz are two rectangles with 
the common side § and the functions v,eQ)(Kj) agree at the endpoints of S 
then v;—v2=0 on S since v,—V2 varies linearly on S. We may now define 


Vaz (veOQ(Q): vine Qi(K), VKETh} 


assuming that Ty={K} is a subdivision of Q into non-overlapping rectangles 
such that no vertex of any rectangle lies on a side of another rectangle. The 
values at the nades may be used as global degrees of freedom. 
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We can also use polynomials of higher degree on each rectangle. For 
example we may choose 


Vn=(veC"(Q): vlKeQo(K), VKETa}. 


where Qa(K) is the set of biguadratic functions on K. ie, 


2 
Q2(K)={v: v(xj= = ajjx{xJ, xe€K, where the ajeR}, 
i. j=0 


and use as global degrees of freedom 


(i) the values at the nodes of Tp, 
(i) the values at the midpoints of the sides of Ty, 
(iii) the values at the midpoint of cach rectangle KET). 


Since the use of rectangular elements requires very special geometry of Q it 
is of interest to also consider more general quadrilateral elements. The 
simplest such element is presented in Problem 12.3 below in connection with 
so-called isoparametric finite elements. 


3.3 Summary 


We have not yet given a formal definition of what we mean by a “finite 
element” To fill this gap define a finite element to mean a triple (K, Px. 2). 
where 


K is a geometric object, for example a triangle, 
Px isa finite-dimensional linear space of functions defincd on K, 
X is a set of degrees of freedom, 


such that a function vé Px is uniquely determined by the degrees of freedom 
©. From Example 3.1 we have that (K, Pe. Z). where 


Kis a triangle, 
PR=Pi(K), 
= is the values at the vertices of K, 


is a finite clement. In Fig 3.7 below we have collected some of the most 
common finite elements (cf [Ci]). The various degrees of freedom are denoted 
as follows: 


Ww 


4 function values 

fo) values of the first derivatives, 
© — values of the second derivatives. 
/ value of the normal derivative. 


= 


2. 
7 value of the mixed derivative oy 4 
9X1 6X2 


Finally, Fig 3.8 indicates in the case of two dimensions the support of certain 
basis function véVy, ie, the points x such that v(x) 40. The different cases 
correspond to a value at a node. the midpoint of a side or a point in the interior 
of an element. Clearly the supportis always small and if @ and y arc two basis 
functions associated with the nodes N; and Np, then the supports of the 
functions @ and p overlap only if Nj and N> belong to the same element 


Degrees of freedom = Degree of continuity 
Geometry Function space Px of corresponding 
FEM-space Vp 
rae 19, Pa(K) ce 
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(See Problem 3.4) 


—— 2 P,(K) co 
—— 3 atk) co 
o_o 4 P3(K) fou 
Pa. se . 
Caw 18 Pek) co 
») (see Problem 3.7) 
4 Pik) o 
10 otk) © 


Fig 3.7 Some common finite elements. 


Jeg 


Fig 3.8 The support of different basis functions. 


8] 


Problems 


3.1 


3.2 


3,3 


fe 
b 


3.5 


3.6 


3.7 


3.8 


T , 
Show that if veP,(I)={v: v(x)= & ajx!, xel. where a;eR}, the set of 
i-0 


polynomials of degree at most r On the interval I, and if v vanishes at 
r+1 distinct points on I. then v=. Recal! that if veP,(1) and v(b)=0 
for some bel, then v(x)=(x—b}w(x) where weP,_) (1). 


Prove that if veP;(K) where K is a triangle, then veP,(S) for any side 
S of K. 


Let K be a triangle with vertices a', i=1, 2, 3. Suppose that ve P,(K) 
and that v vanishes on the side 2?a?. Prove that v has the form 


¥(X)=A1(x)wy-1(X), XEK, 
where w,-j€P,-)(K) and A, is detined in Example 3.1. 


Let K be a tetrahedron with vertices a, i=1,..., 4, and let a denote 
the midpoint on the straight line a'a), i<j. Show that a function 
veP2(K) is uniquely determined by the degrees ol freedom: v(a‘), 
v(aii), i, j=l... 4, i<j. Show that the corresponding finite element 
space Vp satisfies Va»cC%(Q). 


Determine the stiffness matrix corresponding to the Poisson equation 
(1.16) when Q is a square with side 1} and we usc the bilinear element 
1 


of Example 3.7 with ne 


Let K be a triangle with vertices a! and let ail, i<j. denote the midpoints 
of the sides of K. Show that a function véP)(K) is uniquely determined 
by the degrees of freedom v(ai), i<j. Consider the corresponding 
finite element space Vy. Is it true that Vy>cH!(Q)? Can we apply the 
theory of Chapter 2 in this case? 


Show that a function vePs(K)=(vePs(K): ois a polynomial of de- 


gree at most 3 on each side of K} is uniquely determined by the degrees 
of freedom Dv(a!), ja] $2, 3=1, 2, 3, where the ai are the vertices of 
the triangle K. 


Let K be the triangle of Problem 3.6 and let a'23 denote the center of 
gravity of K. Prove that veP4(K) is uniquely determined by the 
following degrees of freedom 


iy Ov 

v(a'), — (a! i= j= 

(@’) 3x; ( ), i=l 2.3. j=, 2, 
v(a), i, J=1, 2,3, i<j, v(a?23), 

Also show that the functions in the corres 

are continuous, 


ponding finite element V, 
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4. Approximation theory for 
FEM. Error estimates for 
elliptic problems 


4.1 Introduction 


For a typical elliptic problem satisfying the conditions (1) —(iv) of Section 2.1, 
we have by Theorem 2.4 


itu—ugllve¥ llu-vilv Wevn. 


Choosing v=n,ueV), to be a suitable interpolant of u and estimating the 
interpolation error ||u apul|y we obtain an estimate of the error ||u—up||y. 
In this chapter we study the problem of cstimating the interpolation error 
|Ju—stpul|y. The interpolant -mue Vj is usually chosen so that the degrecs of 
freedom for V), agree for u and mu. In this case the problem of estimating 
|lu-sthully is reduced to the problem of estimating u— yu individually oneach 
element KeT), 


4.2 Interpolation with piecewise linear functions in 
two dimensions 


We shall first consider the case where V=H!(Q) and V,={veV: vixePi(K), 
VKeTh} where T,={K} is a triangulation of QCR?, ie, Vp is the standard 
finite element space of piecewise linear functions on triangles K (cf Section 
1.7). For KeT; we define (see Fig 4.1) 


hx=the diameter of K=the longest side of K, 
Qx=the diameter of the circle inscribed in K, 
h= max hx. 
KeT, 
To be more precise, we will subsequently be concerned with not only one 
triangulation T,, but a family of triangulations {Ty} that are indexed by the 
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parameter h. We shall below assume that there is a positive constant § 
independent of the triangulation Tne{Th}, ie, independent of h, such that 
(4.1) &k 58 VK eT. 

hx 
This condition means that the triangles KET) are not allowed to be arbitrarily 
thin, or equivalently, the angles of the triangles K are not allowed to be 


arbitrarily small; the constant B isa measure of the smallest angle in any KeT), 
for any Tye (Ti). 


; Let Nj. i=l, .. ., M, be the nodes of Th. Given ueC(Q) we define the 
interpolant xpue Vy by 
ayu(N))=u(N;) i=], 1M. 


Thus mu is the piccewise linear function agreeing with u at the nodes of Th 


We will start by estimating the interpolation error uu On each triangic K. 
We have the following result. 


Theorem 41 LetKeT) bea triangle with vertices a',i=1, 2.3. GivenveC%(K) 
let the interpolant aveP)(K) be defined by 


(4.2) mv(a')=v(al), i=1, 2.3. 
Then 
(4.3) Ilva h¢xy 2h max ||D°y|h (x). 
al =2 
(4.4) max ||D¢(v— x hg q 
max [IDe(v—2v)Ile (Ky $6 Oz max [Dll 04). 
where 


Iv {Lag = max |v(x)]. 


Fig dd 


85 


Before giving a proof of Theorem 4.1 let us comment on the estimates (4.3) 
and (4.4), We note that the size of the errors v-av and D¢(y—av) depend 
on the second partial derivatives of v: the larger these derivatives are, the more 
“curved” is the surface representing the function v and thus the larger is the 
devialion v—srv from the plane representing av (see Fig 4.1). Also note that 
the assumption (4.1) will be used in the estimate (4.4) to bound the quantity 
hg/OK. 


Proof of Theorem 4.1 Let di. i=1, 2, 3, be the basis functions for P)(K) 
described in Example 3.1. A general function weP,(K) then has the 
representation 

3 

w(x)= = w(a')hi(x), xek, 

i=l 

sO thal in particular 
3 . 

(4.5) mv(x)= FZ V@QINCO, xeK, 

i 
since by (4.2) mv(a')=v(a'). We now derive representation formulas for the 


errors v—av and D¢(v— sv), |a|=1, using the following Taylor expansion at 
xEK: 


2 bv 
MOD=V)* ESE (9) Oya) RE y), 
=19Kj 


where 


2 2 
Ro Wah = 2% © Gi-K) Oi), 


i, jot OXON] 


is the remainder term of order 2 and & is a point on the line segment between 
x and y. In particular by choosing y=a', we have 


(4.6) v(ai)=v(x) +p,(x) + Ri(x), 
where 
i i as 
Pi(x)= Aa (aij). a=(aj, a3), 
Ri(x)=R(x, a’). 
Since 


jai~xjlShx, i=1.2,3. j=1, 2, 
J J 
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we have the following estimate of the remainder term Ri(x): 
(4.7) Ri(x)<2hg max [[Dovll..c%), i=1,2.3. 
al= 
Now (4.5) and (4.6) combine to give 
: 3 3 3 
(4.8) av@jev(x) & a(x) + E pi(x)Ri(2) + = Ri(x)ai(x) xeK, 
i= I= inl 
We now need the following lemma whose simple proof is given below, 
Lemma 4.1 For j=1, 2 and xeK we have 
3 
(4.9) Ehi(x)=1, 
i=1 
3 
(4.10) = pi(x) (a) =O, 


“A(x)= > Btn) <0. 


3 
(4.11) = 
i=) Ox; Oxji=1 


u oki av 
4.12 = pix) ioe & &y, 
(4.12) AP (x) Bx) (x) 2x) (x) 


By (4.9), (4.10) and (4.8) we have 
m(siav(a)+ 3 RiGOailx). 
which gives us the following representation of the interpolation error: 
v(x) T(x) = — 2 Ri a(x). 


Since OSA\(x)<1, if xEK, i=1, 2, 3, we can use the Previuus estimate (4.7) 
of the remainder term R; to get 


Ivey-RvQois F/R Dat) 


3 
max ROO Z A) =2hg max NDevIlecey. —-x€K, 
i i= ale 


which proves (4.3). 
To prove (4.4) we differentiate (4.5) with respect to x; to get 
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anv 
Oxy 


3 4s 
(= E via) 24 G0, 
i=l 8x1 


which together with (4.6) shows that 


3 3 Or 3 Ai 3 
(4.13) 2% axy=v(x) 5 Maye 5 pin Mee F Rie) 2M. 
Ox) i=lOxy i=l 8x, i=l 8x1 
Hence, by (4.11) and (4.12) we have 
amv(x) av 3 hi 
—= (x)= — (x) + E Rix) —(), 
Sn = FLO) EROS) 
which gives the following representation of the error 2 > aa. 
x1 Oxy 
av amv 5 3h, 
—(x)-—(J)=—- 2 R(x) —(4), xeK. 
mi ) SE ) re i Jeo ) 


It is now easy to see (cf Problem 4.2) that 


BAj 


I 
3x0) 


<< 


OK 
which together with (4.7) finally gives 


4.14 max 
( ) xeK 


2 
oN oa =< 6K max| |D°vI|L=(K)- 
Ok lal=2 


——(x)- (x 
8x1 Sa ) 
A dv anv 
In the same way we estimate — — — 
Qx2. OX? 
the theorem is now complete once the lemmu is established. 


and thus (4.4) follows. The proof of 


Proof of Lemma 4.1 The proof is based on the following observation: 
(4.15) av=v if veP\(K), 

which of course follows from the fact there is a unique function ve P)(K) 
assuming given values at the vertices of K. If we now choose v(x)=1 in (4.8), 
in which case clearly v=av, we get 


3 
1= ZA(x), xeK, 

i=] 
since in this case pj =R,=0. This proves (4,9) and (4.11) follows directly. 


To prove (4.10) we choose v(x)=dix1 +d2%2 in (4.8) with de R. Again v=av 
and further 
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pi) =dy(al—x1) + do(ah—29), 
and R;=0 so that by (4.8) 


3 
vxJ=Hv(x)+ £ [di(aj—x1)+do(as—xz)JAi(x), xe K. 
int 
and so for all deR we have 


3 
z [di(aj—x))+d2(a5—x2)] AOQSO xeK. 
iz 


This proves (4.18) by choosing 4=2e (x). i=1, 2. Finally, (4.12) follows in 
Xi 


a similar way by choosing v=d1x)+dox2 in (4.13). This finishes the proof of 
the lemma and the proof of Theorem 4.1 is complete. O 


Since Theorem 4.1 states estimates of the interpolation error using the 
L.»(K)-norm, it is not ideally suited to give estimates for Ju=zthullreey 
involving the L2-norm. For this purpose we will use instead the following 
analogue of Theorem 4.1. [lere we use the following notation for r=0, 1, 
2 


Ivhr@=( = f[Deviedxy'2. 
lel mr oo 
Note that halturen measures the L2(Q)-norm of the partial derivatives of v of 
order exactly equal to r, whereas derivatives of order less than r are not 
included. We say that |: [H'(<) is a seminorm. Since we may have lv|q4(aQ)=0 
even if v#0 (eg if v=! and r2=1), it is not a norm. 


Theorem 4.2 Under the assumptions of Theorem 4.1 there is an absolute 
constant C such that 


|[v—avilLcc)SCh Zl vleeK). 
he 
lv—avln'eKy SC on via. 


We see that Theorem 4.1 and 4.2 have exactly the same structure. the only 
difference being the norm involved, either the La or the L2-norm. For 
simphiity we have chosen to present a proof in the L.-case since we then avoid 
some technical complications (for a proof of Theorem 4.2, see [DS]). 


Let us now apply Theorem 4.2 to estimate the global interpolation errors 
Ju —spullt. cay and [u—syUl pq). We have by summing over KeTp, 
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lu-saulPcay= > lull cay S z Cr glulirag 
KeT,, KeT, 


; 
<Cht > \ulfe ag = Ch ulfegsy. 
Ket, 


and similarly using (4.1), ic, 2& < 4, 


ex 6B 
“ 4 2 
(4.16) u-myl Ways B,C TE wltiws 2, Sittin 
= Saat 
so that 
(4.17) |Ju—suyu| [Hays a julH?(2)=Chluligyay, 


if the constant fi is included in the constant C, and 


(4.18) |Ju—stnul [Lay SCh?JulH’ (a). 


4.3 Interpolation with polynomials of higher 
degree 


The estimates (4.17) and (4.18) are typical examples of estimates for the 
interpolation error u—zyu, in this case for interpolation with piecewise lincar 
functions. If we work with piecewise polynomials of degree r=1 on triangu- 
lations Th satisfying (4.1), we have in the typical case the following estimates: 


(4.19) [Juma |LaySCh"* Jul yay, 
(4.20) ju att] Hay Chul Ha). 


where the constant f is absorbed in the constant C in (4.20). If V,c H2(Q), 
then we also have 


(4.21) Ju-snulH?¢aySChulyr(ay. 


Note that for each derivative of the error u-m)u, the power of h on the right 
hand side drops by one. Note that the constant C in (4.19)-(4.21) only depends 
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on the constant B in (4.1) and the degree r, but not on the mesh parameter 
h or the function v. 


Remark 4.1 If u does not have the regularity required in (4.19) or (4.20), we 
get the corresponding reduction in the power of h: For 1<s<r+1, we have 
(4.22) fu saulle.gaySChluj ray. 

(4.23) Jlu—stnullreoay<Che"luleryay. 

Example 4.1 Let {Th} be a family of triangulations Th={K} of QcR? 
satisfying (4.1) and let Va={veC%(Q): vlkeP2(K), WKeT)}. For the finite 


element of Example 3.2 we may for veC%(Q) define the interpolant mye Vp 
by 


aAnV=v at the nodes of Th, 
T,¥=Vv at the midpoints of the sides of Ty. 


In this case (4.19) and (4.20) hold with r=2. O 

Example 4.2 With Ty={K) as in Example 4.1 define Vn={veC!(Q): 

v|kePs(K). WKeTh} and for veC?2(Q) specify the intcrpolant mveVp by 
D®m,v=D*v at the nodes of Ty. [a|<2, 


LS oN at the midpoints of each side $ of ‘Ih, 


én on 


where < denotes differentiation in the normal direction to S. In this case 


(4.19)-(4.21) hold with r=5. 


4.4 Error estimates for FEM for elliptic problems 

Recalling again the typical abstract error estimate for an clliptic problem 
|Ju-upllysC|lu-v||v VveV), 

and choosing here v=a,u with mpyue Vp and interpolant of u, we have 


(4.24) Hu—unllysClu—spully YveVn. 


9) 


Using estimates for the interpolation errur |lu—mpully we then obtain 
estimates for the finite element error ||u—uj||y. Using the interpolation 
estimates of Sections 4.2 and 4.3 we have for example the following error 
estimates: 
Example 4.3 With V=H\(Q) and (cf Examples 3.1-3.3) 

Va={veV: viceP,(K), VKeTh}, r=1. 2, 3. 
we obtain from (4.20) and (4.24) 

|]u-upllH'caySCh*lulte (a) 
for the finite element method for the Dirichlet problem (1.16). We obtain a 
similar result for the Neumann problem (1.36). 
Fxample 4.4 With V; as in Example 4.2 we have for the biharmonic problem 
of Example 2.5 the following estimate 

JJu—unllegeySCh4{ul yay qo 
Remark 4.2 It is possible to prove analogues of (4.24) in norms other than 


that given hy the space V. For example one can prove for the finite element 
method of Section 1.4 that (see [RS]) 


I|Yu-Vualle.caysC]] Vu- 9 apulle cay, 
which together with Theorem 4.1 gives 


ul 


(4.25) (Mu VuallrcaysC max [hk max [|D“ul|t ay). 0 
Jul=2 


4.5 On the regularity of the exact solution 


We have seen that the regularity of the exact solution u is involved in 
estimating the error |/u—unlly in the finite element method. Let us now give 
a typical result that shows how the regularity of the exact solution u depends 
on the regularity of the given data. Let us then consider the Poisson equation: 


-—Au=f in Q, 


4,2 
(4.26) u=0 on’, 
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where Q is bounded domain in R? with boundary [ and f is a given function. 
Let us first assume that [ is smooth, ie, T is a smooth curve in particular 
without corncrs or cups. In this casc there is for s=0, 1, . ., a constant C 
independent of f such that 


(4.27) HullHe?(ay Cll flaca. 


ie, if feH*(@) then uc H?*2(Q), or loosely speaking. we “gain two deriva- 
tives” in (4.26). 

It Tis not smooth, then (4.27) may not hold, not even for s=0. If T has 
a corner, then the solution u or derivatives of u will in general have 
singularities at the corner even if f is very smooth (f€ HS(Q) for s large), More 
precisely, the solution u of (4.26) with tf smooth basically has the following 
form close to a corner with angle w (cf Problem 4.6): 


(4.28) u(r,8)=r¥a(8)+B(r.0). y= a 
where @ and 6 are smooth functions (here we use polar coordinates (r, 8) with 
the pole at the corner), It is easy to see that if w>7 then a function u of the 
form (4.28) does not belong to H?(82) if a0, On the other hand, one can 
show that (4.27) holds with s=0 if Q is a convex polygonal domain (in which 
case the corner angles satisfy w<7). 

For the biharmonic problem (2.22) we have if the boundary T is smooth, 
for s=0, |, 


hulle (ay Cll fll a) 


If [ has corners there are results analogous to those just stated for the Poisson 
equation (4.26). 


Example 4.5 For a solution u of the form (4.28) we have formally that 
u€H(&2) derivatives Dou of order s belong to LQ2)6 


R 
J|Dsul2dx ~C f[r’-sPrdr<~, 
Q i?) 


Hence ueH*(Q) if and only if s<y+1. By Remark 4.1 we thus have for the 
standard finite element method of Section 1.4 for the Poisson equation in a 
polygonal domain that for any e>0 


(4.29) |[u—apllret(ayssChY~ ful |p gy = Chie, 


where y=n/w and w is the maximal angle of a corner of [, For example if y="/, 
which corresponds to a concave corner of angle 3n/2, then 
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Roe 


3 
|u=uplly'aysCh 
We see that in this case we do not obtain the full rate of convergence which 
is O(h). O 


4.6 Adaptive methods 


If the exact solution u has eg a corner singularity, then it is natural to refine 
the triangulation close to the corner to increase the accuracy, Recalling that 
for the method of Section 1.4 (cf (4.16)) 


(4.30) fu-upli'(ayS|u—shul yoy SC{E(hylul ray)? ]"?, 


it is clear that we somchow would like to balance the size of hy with that of 
julH(K) and in particular choose bx small where |ulp¢K) is large. If u has the 
form (4,28) with 0<y<1, then one possible refinement is given by (cf Problem 
4.4.) 


(4.31) hy=Chdb-’, 


if hk <dx, where dx is the distance from K to the corner and h is the mesh 
size away from the corner. With this refinement we have, disregarding the 
g, 


(4.32) fu—uple'yay<Ch. 


Notice that the total number of elements with a refinement of the form (4.31) 
is of the order 0(h7?), i.e., the same as with a uniform mesh of size h, Thus, 
in this case the retinement does not increase the total number of unknowns 
significantly but significantly increases the precision (from (4.29) to (4.32)). 

In general the nature of the exact solution u is not known beforehand and 
then it is not clear how to locally refine the finite element mesh. Recently 
methods for automatic mesh refinement, so-called adaptive methods, have 
been developed which do not require the user to supply information on the 
smoothness of the exact solution. In these methods this information is instead 
obtained through a sequence of computed solutions on successively refined 
meshes. 

‘lo very briefly describe some of the basic ideas underlying adaptive 
methods, suppose 6>0 is a given tolerance and suppose we want to obtain a 
finite clement approximation u, such that 


(4.33) Ju-upliyey s 8. 
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Relying on the error estimate (4.30) we sce that (4.33) will be satisfied if the 
corresponding finite element mesh Th={K} 1s chosen so that 


(4.34) EZ (helulueog)?~(2)2. 
KeTy Cc 

To determine a mesh satisfying (4.34) we may proceed as follows: Choose a 
first mesh Ty={K} and compute a corresponding finite element solution ay. 
Using Gy compute approximations to [u|}?(R) denoted by \ayl44R) for KeTy. 
The quantity JOnhHe Ry may be obtained using difference quotients based on 
the values of Vup at the centers of gravity of K and neighbouring triangles 
in Tp. Next, construct a new mesh Ty={K} by subdividing into four equal 
triangles each Ke Ty for which 


2 
he Pape so 28 
(bRfGnl HAR)? > he 
where N is the number of triangles in Th. Next, compute the finite element 
solution uj; on the new mesh Ty and repeat the process until 


(4.35) E (hxlunlwtagy? <(2)2. 
KeT, Cc 


Note that by the construction if follows (if 3 is smal} enough) that for the final 
mesh Ty satisfying (4.35), all the terms in the sum will be approximately equal 
Note also that after refinement of certain triangles, the resulting mesh is 
completed into a triangulation as in Fig 1.15. 

It is also possible to control the error in other norms than the H!(Q)-norm 
used in (4.33), for instance we may want to control the gradient error in the 
maximum norm. In this case we base the adaptive method on the error 
estimate (4.25) and seek to find a mesh T,={K} such that 
(4.36) ChxmaxlCunh.¢x)~ & VKeTy. 

jal=2 
where as above ||D¢upll_cx) is a computed approximation of ||D°ull.,«). 
Again the final mesh satisfying (4.36) is constructed through a sequence of 
successively refined meshes where triangles K for which the left hand side of 
(4.36) is larger than 5 are refined. In Fig. 4.2 we give the sequence of meshes 
(with a zoom at the origin for the final mesh) obtained by applying an adaptive 
method of this form with 6=0.1 and C=1 to the problem 


Au=0 in Q, 
u=ug on T, 


95 


where Q={x=r(cos@. sin®y, O<r<l, 0<0< 3/4} with exact solution 
u(r, @)=r’sin(y0), y=4/3. In Fig. 4.3 we give the actual gradient error 1 Vetx)| 
as a function of the distance |x| to the origin along the radius 6=a/2. We 
observe that the gradient error is roughly equal to the tolerance and thus we 
see that the adaptive method is able to find a good inesh in this case. This 
example is taken from [EJ2], where theoretical and computational results for 
adaptive methods of the indicated type are given, see also {E]. 

For adaptive methods for parabolic problems we refer to Section 8.4.4, For 
another approach to adaptivity, see [BR], [BM]. 
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Fig 4.2 Sequence of meshes abtained by adaptive FEM 
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Fig 4.3 Gradient error 
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4.7 An error estimate in the L2({2)-norm 


We have seen that if we apply the finite element method with the space 
Va= (ve H\(Q): vikePi(K), VKETh) to the Poisson equation (1.16) with 
a polygonal domain, then we have the following estimate for the error u—up 
in the H!(Q)-norm: 


(4.37) Juan) iitaySChl alee). 
This trivially gives the following L2(Q)-estimate: 
(4.38) {lu-up! tee) =Chlul Hay. 


On the other hand by (4.18) the interpolation error, u-mpu, satisfies the 
second order estimate: 


uw mulleg2ySChJuly(ay. 


We shall now prove that we have a similar estimate for ||u—uy{{p,cay So that 
this quantity in fact converges at the optimal rate. We shall then assume that 
the polygonal domain Q is convex (if Q has a smooth boundary, then convexity 
is not required), 


Theorem 4.3. lf Q is a convex polygonal domain and up ts the finite element 
solution of the Poisson equation (1.16) with piecewisc lincar functions, ie up 
satisfies (1.20), then there is a constant independent of u and h such that 


Ne unlle cay SCh7{ul wary. 


Proof. Subtracting (1.19) and (1.20) we obtain the error equation 
(4.39) a(e,v)=0 VveVy, 


where e=u—uy and the notation of (1.19) is used. We shall now estimate 
(e, €)=|lell? cay using a so-called duatity argumem which is often used in finite 
element analysis (see also Chapter 8). Let @ be the solution of the following 
auxiliary dual problem: 


—Ag=e inQ, 
g=0 on I, 


Since Q is convex we have from (4.27) with s—0, 


(4.40) ella SClleliiycay. 
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where the constant C does not depend on e. Using Green’s formula and the 
fact that e=0 on T, 


(c, c)=—(e, Ap)=ale, #)=ale. Pang), 


where the last inequality follows from (4,39) since MPPEVH SO that 
a(e, mp) =0. Applying now the interpolation estimate (4.18) to p and using 
also (4.40), we find 


iJell 2c) <Hlell esa Hp ene e4 y= lle ee coy hl ele) 
<Chilelli@llellu re). 

Dividing by |lelli.¢a) and recalling (4.37) we finally get 
\lell.caySChllellix'cay=Ch"u| (2) 


and the proof is complete. 0 
Remark 4.3 ‘The basic stability inequality (2.6) for (4.26) states that 


(4.41) llulliteae 
where A is any constant such that 

ILOM=KE YHSAl lwo We Hi). 
The smallest possible choice of A is given by 


(4.42) A= sup St I ; 
ve Hic) !lla'ea 


v#0 
Clearly the quantity A defined by (4.42) measures the size of f in a certain 
sense and in fact we may define a norm || -!lu-'(ay by 
li, v)) 
(4.43) [Ifln-ya= sup a. 
veH}(a) lheay 
v#() 


This is the norm in the so-called dual space H~!(2) of Hj(Q). Note that 


(ct. v)I 

IIfllngay= SUP 
veL(2) IIv[L@) 

v#0 
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and since we take sup over a larger set. we clearly have that ||-I}p ay is a 
stronger norm than || ||H-"¢q). ie. 


[Illi 'cy sitll ce): 
By (4.42) and (4.43) it follows that the basic stability inequality (4.41) may 
be written as 


clip 
| ulliteayss = lithe 142). 


which formally corresponds to (4.27) with s=—I, 9 


Probiems 

4.1 Let [=[0. h] and let mveP (J) be the linear interpolant that agrees with 
veC%(1) at the end points of 1. Using the technique of the proof of 
Theorem 4.1 prove estimates for ||v— vj} and {lv’—(av)'||L cp. cf 
(1.12) and (1.13). 


42 Prove (4.14). 


4.3 Estimate the error |[u—unliH?qy for Problem 1.5 and Example 2.4. 


4.4 Prove thal the total number of elements with a corner refinement of 
the form (4.31) is O(h72). 


4.5 Determine a suitable refinement in case the exact solution has a 
singularity of the form (4.28) with 1<y<2 and we want to control 
I Vu- Vue cay via the estimate (4.25), cf [EJ2]. 


4.6 Using polar coordinates (r, 6). let Q={(r, 9): U<r<t, O<0<w} be 
a pie-shaped domain of angle w. Prove that the function u(r, 0) 


=rvsin (6), y=, satisfies: Au=0 in Q, u=() on the straight paris 
w 
of the boundary ot &2. 


4.7 Prove, by modifying the proof of Theorem 4.3, the Iullowing L3- 
estimate for the standard finite element method of Section 1.4 for 
Poisson’s cyuation on an L-shaped domain (cf Example 4.5): 

ifu-unl le ¢aySCht 3. 


4.8 — Let Vp be a finite element space on a triangulation Ty of the domain 
OcR¢ satisfying (4.19). Given ueL2(2) let uneV, be the L2{Q)-pro- 
fection of u onto Vy, ie, 
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(4.44) (un, v)=(u, v) VeVi, 


where (.,.) is the scalar product in L2(Q). Prove the error estimate 


|]u—upl[L,caySinf [lu-v]fL,caySCh"* "uly (ay, 
veMy, 
and that 


[unt cay{lulli.(a). 


5. Some applications to elliptic 
problems 


This chapter presents applications of the finite element method to sume busic 
problems in continuum mechanics of elliptic type. We first give suitable 
variational formulations of the continuous problems. 


5.1 The elasticity problem 


Consider a homogenous isotropic elastic body B occupying the bounded 
domain Qc R? with boundary I decomposed into two parts Ty and T2 with 
the area of Tz being positive. Let B be acted upon by a volume load f= 
(fy, f2, f3) and a boundary load g=(g:, g2. g3) on I. where the fi and gj are 
the components in the x;-direction. Further, lct us assume that B is fixed along 
[3 (see Fig 5.1). 


Fig 5.1 


We want lo determine the displacement u=(u1, ty, us) and the symmetric 
stress tensor o=(9;j), 05=9;j, i, j=1, 2, 3, under the loads f and g. Here uj is 
the displacement in the x-direction, oj is the normal stress in the x;-direction, 
and the oy, i#j, are the shear stresses. Further, €(u)=(ei;(u)), where 
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ee oe ee 


28x; Ox; 
is the deformation (tensor) associate with the displacement u. Assuming that 
Bis linearly elastic and that the displacements are small, we have the following 
relation between stresses and deformations. or constitutive relation (Hooke’s 
law): 


(5. 1a) oy=A div u 8, +pHE;,(u), 
where 4 and u are positive constants, 


3 ri 
divu= = SY 
i=1 OX, 


UZ { lif i=j, 
TL Oif 14). 
We also have the equilibrium equations 
4. 3a: 
(Sb) — & S=5, in Q, i=1,2.3, 
j=1 Xj 
together with the boundary conditions 


(S.1¢) u=0 on [, 
3 

(5.14) = oynj=B onl), i=1, 2, 3, 
jel 

where n=(n,) is the outward unit normal to I. 


Remark The constants wand v in (5.1a) can be expressed as 
E Ev 


pete ee eee ee. 
ltv (l+v) (1-2) 

where E is the modulus of elasticity (Young modulus) and v is the contraction 
ratio (Poisson ratio) of the elastic material of B. OQ 


In the remainder of this chapter the following notation for partial derivatives 
will be used 


We shall also use the summation convention that repeated indices indicate 
summation from 1 to 3. With this convention we may write the equilibrium 
equations (5.1b) as follows 
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-oj=f in Q, i=1, 2, 3. 


We will now give a variational formulation of the elasticity problem (5.1). 
Let us then first note the following Green’s formula: 


(5.2) J ojjei(v)dx= J aynyjds—f 0,5 vidx, 
Q r Q 
where the summation convention is applied in all terms, ie, we sum over i 


and j from | to 3, To show (5.2) observe that since 0,=0), and ei) =5 (¥4,j 
+v; i), we have 


1 1 
aye) = Floyrigt HIV D= 5 (Ory tM) = OHM) 


Hence, by Green’s formula (1.17) we get 

Joey (v)dx =f ov; jdx= Jojgn,yids— J oj jvidlx, 

Q rey r a 
which proves (5.2). Let us next choose a test function v=(vj, v2, Vs) €[ID(Q))° 
(ie each component v;e¢H!(Q)) such that v=0 on T3, multiply (5. 1b) by vj, 


sum over i from | to 3 and integrate over Q. By Green's formula (5.2), we 
then have 


Sfividx=— fog jvidx= J ojje;\(v)dx— foinjvids, 

2 a 2 i 
where the boundary integral over [2 vanishes since v=0 on [2. Using also 
(S.1d) we thus have 

Joyei(v)dx= J fividxt fgivids. 

Q Q is 


Finally, we eliminate oj by using (5.1a) to get 


STA div u div v+pej(uej(v)jdx= ffividx+ fgivids, 
Q 2 i 


since 
div u dije(v)=div u div v. 


We are thus led to the following variational formulation of the elasticity 
problem (5.1): Find ue V such that 


(5.3) a(u, v)=L(v) Vvev, 


where 
a(u, v)=[[A div u div v+pey(u)ej(v)]dx, L(v)= Jfividx + Jawvids, 
Q a q 


V={ve[H!()]*: v=0 on T}. 
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Let us now check if the assumptions (i)—(iv) of Section 2.1 are satisfied in 
this case. We can routinely verify (i), (ii) and (iv) and thus it only remains 
to prove the V-ellipticity, 1¢, to prove that there is a positive constant a@ such 
that 


(5.4) a(v, vyzullvllg  VveV. 


where 
3 
‘vllv=llolli'cay=C_ © lvl sey)" 
fe 


This inequality follows directly from Korn’s inequality: There is a positive 
constant ¢ such that 


(3.5) Seiyen(v)ax 2ellvG=cUMi Fray tlIMlia)- 


We notice in particular that (5.5) amounts to proving that the I.)-norm of any 
partial derivative vj.) can be estimated by the L2-norms of the deformations 
£)j(¥) involving only certain combinations of the v,,). Since (5.5) involves also 
the L2-norm on the right hand side, we need [> to have positive measure (cf 
Example 2.7). For a proof of Korn’s inequality we refer to [Ni] (the proof is 
easy in the case P'2—T, cf Problem 5.2). 

Now we are able to formulate a finite element method tor our elasticity 
problem. Let then Ty={K} be a “triangulation” of Q into tetrahedrons K as 
described in Example 3.6 and define 


Vn=(veV: v[Ke[P(K)]3, WKeTh}. 


Each component y; of a function veV}, is thus a piecewise linear function 
vanishing on [2. We now formulate the following finite element method for 
(5.1): Finu u,e€Vp_ such that 


a(un, v)=L(v) WvevVh. 


According to the general theory of Chapter 2 this problem has a unique 
solution and by the interpolation results of Chapter 4 we have the following 
error estimate: 


IJu-unlleay<SCh]uli ero). 


Problems 

31 Consider the elasticity problem (5.1) in a three-dimensional domain 
Q=Qx(-F, £) with QCR?, ¢ small and F3={,=0. This corresponds 
to a thin elastic plate with middle surface Q subject to in-plane loads 
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only (no transversal loads). Assuming “plane stresses” (ie, 4)3=0, 
i=1, 2,3) prove that in this case (5.1) is reduced to: 


oj=Aer(u) +es2(u)) jth elu), i, j=1, 2, 
2 oy e Ee eit 

- = =f, inQ, i, j=1, 2, 
ata j 


up=uz=0 onl), i=l, 2, 
2 a 
Lojnj=F, onlz, i=l, 2. 
j=l 


Ey -__E 
ues) 
boundary of @ and f, and F, are given forces. Give a variational 
formulation of this problem and formulate a corresponding finite 
element method. Determine the stiffness matrix in a problem with 
simple triangulation and piecewise linear displacements. In Fig 5.2 
below we give the computed displacements using bilinear clements on 
the indicated triangulation for the above problem corresponding to a 
thin plate fixed at both ends and subjects to a distributed load as 
indicated. The Young modulus E is here different in the upper and 
lower halfs of the plate denoted by | and I], with E being larger in Il. 


where k= , T, and [ is a decomposition of the 


§.2, Prove Korn’s inequality in the case T2=[. Hint: Prove that 


Soy idx = fyi ivj idx. 
Q Q 
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5.2 Stokes problem 


The stationary Stokes equations for an incompressible Newtonian fluid with 
viscosity 1, enclosed in the domain 2c R3, and acted upon by the volume load 
f, read in the notation of Section 5.1: 


Oy =2ye,(u)—p 5, in Q, 
~Gy=h in Q, 
div u=0 in Q, 

u=0 on QF 


where o=(gj)) is the stress, p the pressure and u=(u;) the velocity. Eliminating 
i; we obtain the following equivalent formulation: 


(5.6a) —uAu+pj=fi in Q, i=1, 2,3, 
(5.6b) div u=0 in &, 
(5.6c) uj=0 onl, i=}, 2,3. 


We now seek a variational formulation of (5.6). Let ve[H}(Q)} be a test 
funciion satisfying the incompressibility condition div v=0 in Q, multiply 
(5.6a) by v,, integrate over Q and use Green’s formula. Then summing over 
i, we get 


Sfividx=—pf Aujvidxt Jp ividx 
Q Q a 
=-f BUY s+ uf Vu,- Vv,dx+ fpniv,ds— fpv,.dx 
r on Q r Q 
suf Vu;. Vvidx, 
Q 


since v;=0 on T and div v=v;,;=0 in Q. Thus we are led to the following 
variational formulation of the Stokes problem (5.6): Find ueV such that 


(6.7) a(u, v)=L(v) Wev, 


where 
a(v, w)=pf Vvj- Vwidx, 
Q 
L(v)= ffividx. 
2 
V={ve[H)(Q)]*: div v=0 in Q)}. 
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We can easily check that the conditions (i)—(iv) of Section 2.1 are satisfied. 
Note that in the formulation (5.7) the pressure p has “disappeared”, which 
comes from the fact that we are working with a space V for the velocities where 
the incompressibility condition div v=0 is satisfied. 

To formulate a finite element method for (5.6) based on the variational 
formulation (5.7) we need to construct a finite-dimensional subspace Vj», of 
V. It turns out that this is not altogether easy since we have to satisfy the 
condition div v=0 exactly. For simplicity, let us consider the analogue of (5.7) 
in two dimensions, in which case 

3 Bv2 


<i Wy 
V= {(v=(v1, va)e[Hi(Q)]?: div v= aa + a 


=U in Q}, 


where QcR?. By a standard result in advanced calculus it follows that if 8 
is simply connected, ie, if © does not contain any “holes”, then div v=0 in 
Q if and only if 
ap _ 3p 
v=rot =| —-, -—— ], 
bd | 8x2 Oxy 


for some function @. More precisely (cf Problem 5.1), one has 
(5.8) veVev=rot @, peH}(Q). 
The function @ is the stream function connected with the velocity field v. 
Let now Wy, be a finite-dimensional subspace of HQ) eg constructed using 

the Cl_element of Example 3.5 and define 

Vn={v: v=rot @, peWn}. 
Then Vac¥ and formulating a finite element method in the usual way by 
replacing V by Vp, in (5.7) we obtain a discrete solution up satisfying the 
following error estimate: 

[u—unllio,Ch4 |u| (a). 


Chapter 11 gives other finite element methods (so-called mixed methods) 
for the two dimensional analogue of the Stokes problem (5.6) nut requiring 
the velocity space Vp to satisfy the incompressibility condition exactly. 
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5.3 A plate problem 


Consider a thin elastic plate P with middle surface given by the domain Qc R?2 
with boundary F and acted upon by the transversal Joad f, see Fig 5.3. 


Fig 5.3 


We seek the transversal deflection u together with the moments oj, i, j=1, 
2, under the load f. Here oj; is the bending moment in the x; — direction and 
012= 0) the twisting moment. Assuming small deflections and a linearly elastic 
material, we have the following constitutive relation (cf Hooke's law): 


(5.9) oj=AAu djtprij(u), i=1, 2, 
where i and u are positive constants, and 
eu 
a hala ior 


defines the curvature tensor. Further we have the following equilibrium 
equation: 


(5.10) 


where again the summation convention is used. 
To define the boundary conditions let n=(n,, nz) be the outward unit 
normal to I, t=(ty, t2)=(n2, —n)) the tangent to I and define 
du 


an ati (normal derivative), 
n 
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=U jt} tangential derivative), 

a? ? 

San= GYD; (normal moment), 

Oat =Oijnit (twisting moment), 
3Ont 

R(o) =o, jni+ (transversal torce). 


at 


Now let the boundary [° be partitioned into three parts Tj, i=), 2, 3, and 
consider the following boundary conditions: 


(5. 11a) n=Sbag on I, (clamped), 
én 
(S.11b}  u=G_n=0 on [2 (freely supported), 


(5.1Lc) Onn=R(a)=0 on Ty (free boundary). 


Let us now give a variational formulation of the plate problem (5.9}-(5.11). 
Let veH?({2) be a test function satisfying the essential boundary conditions 


oh 
Saar onl), 
(5.12) v=0 on 12. 


If we now multiply the equilibrium equation (5.10) by v and integrate over 
Q. then repeated usc of Green's formula gives 


(5.13) Jfv dx= Joijv dx=foyjniv ds foi jv.dx 
Q Q r Q 


=Jajyjniv ds— Saynyv ids + foyxi(v)dx. 
T . 2 


Since 
3 cS) 

viz 3, nit ah i=1, 2, 

we have 
ov av ov av 
oyniv =oxnin; 2 4 ont) 2 =Gon + on 2, 
ANY §= ODN an ot 0 On Laer 

so that (5.13) can be written 
(5.14) foijey(vldx= ftv dx— Joy nv dst Sonn Was + fon 2¥ds 

Q Q c rn r 3t 
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If the boundary Tis smooth, partial integration along I gives Problems 
5.2 Prove (5.8). 
Son BY gon — fp 2Onty ds. 
T 


at r 3t 5.3. Show that the analogue of Stokes equations (5.6) in a two-dimensional 
7 : ‘ aod simply connected domain Q can be formulated as the biharmonic 
in which case (5.14) can be written problem (2.22) by introducing the stream function as unknown. 
Joixi(vdx= ftv dit Jon Yas {R(@)v ds. 5.4 Show that the plate problem (5.9}(5.11) takes the form (2.22) if 
Q 2 ie ii) Tr ly=!. 


If we now use the boundary conditions (5.11) and (5.12), we sce that the 
boundary integrals disappear and on eliminating oj also, by using (5.9). we 
finally get 


sau Avs wori(ubeg(vy]dx= J fv dx. 
Thus we are Jed to the following variational formulation of the plate problem 
(5.9)-(5.11): Find ueV such that 

a(u. vy)=L{v) Vvev. 
where 

auv)=[PAudv+ wry(uyey(y)]dx, 


L(v)= [fv dx, 
Q 
V=(veH(®): v= *=9 on Ty. ¥=0 on F3}. 


We immediately see that the conditions (i), (ii) and (iv) of Section 2.1 are 
satisfied and it is possible to verify the V-ellipticity for example in the case 
when the length of Ty is positive, ie, when P is clamped along a part the 
boundary (cf Problem 2.2). 

We can now ina routine way formulate a finite clement method for the plate 
problem using the C!-element of Example 3.5. We leave the details to the 
reader. 


Remark The constants 4 and \1 in (5.9) are given by 


2. Ea us vEa3 
1%1+v)) © 12(1-v2)’ 


where E is the Young modulus and v the Poisson ratio of the elastic material 
of the plate. O 
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6. Direct methods for solving 
linear systems of equations 


6.1 Introduction 


We have seen in Chapters | and 2 that application of the finite element method 
to a linear elliptic problem typically leads to a linear system of equations 


(6.1) AE=b, 


where A=(ajj) is a symmetric, positive definite and sparse MxM matrix, and 
beR™. We also know that the unique solution GERM of (6.1) can be 
equivalently characterized as the solution of the quadratic minimization 
problem 


(6.2) Min it yn An-b-n]. 
neRM 2 
To compute the solution € we can start either from (6.1) or (6.2). In this 
chapter we shall study some direct methods, or methods based on Gaussian 
elimination, for the solution of (6.1). In the next chapter we shall study some 
minimization algorithms for the solution of (6.2) that may be viewed 
equivalently as iterative methods for (6.1). 


Remark Finite element methods for first order hyperbolic problems typically 
lead to non-symmetric linear systems of equations, see Chapter 9 below. In 
this case there is no associated minimization problems (unless a least-squares 
formulation is used) and it js not yet clear how to construct efficient iterative 
methods for general classes of non-symmetric problems. Thus, for such 
problems Gaussian elimination (with pivoting, cf below) is often uscd. O 


6.2 Gaussian elimination. Cholesky’s method 


We recall (cf any basic course in numerical analysis) thal using Gaussian 
elimination to solve (6.1), we obtain a LU-faciorization of A of the form 
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(6.3) A=LU, 


where L= (lj) is a lower triangular MXM matrix (ie, lij=Oif j>i), and U=(ujj) 
is an upper triangular matrix (ie, ui=0 if j<i), or diagrammatically: 


xX XK 
x 


ere) 
x 


From the factorization (6,3) it is easy to solve the system AE=b by using 
forward and backward substitution 10 solve the triangular systems: 


(6.4a) Ly=b, 
(6.4b) US=n. 


We recall that U=A™) where the matrices A®,k=1,.. 


-, M, are successively 
computed as follows: 


(i) AD=A, 
(ii) Given A) of the form 
(ky (KO) 
as cheaters 
ke [ot oO alk 
A@e | oa ae 
k (k 
0 0 ale ae 


(k+1) i=l, .,k,or 
ag Aap j=l... ., k= 
(6.5) 
kL k = 
al t baal) — aft) i=k+1, ., M, and 
AK 
J=k,. .,M, 
under the assumption that al #0, 
We also recall that L=(4j), where 
f= 1, i=t,...,M, 
aft 
ly= — “ik, isk+1, ..,M,  k=1, M 
»M, 
kk 
In=0, if i<k. 


One can show that if A is symmetric positive definite, then al) >a, 
k=1,...,M. Thus, Gaussian climination can be performed without pivoting. 
In addition, under the same hypothesis it is not necessary to perform pivoting 
to prevent numerical instability due to too small pivot elements af). Thus, 
we may perform the Gaussian elimination in any desired order. We will see 
below that different direct methods for (6.1) essentially differ in the choice 
of the order of the elimination, ie, the enumeration of the nodes in case we 
perform the elimination according to the ordering of the nodes. 
Since A is symmetric positive definite we may alternatively factur A as 


A=BBI, 
with B=DL and where D is a diagonal matrix with diagonal elements 
dix= Val) k=1,...,M, 


and L and att) are obtained through the Gaussian elimination given above. 
Here BT denotes the transpose ot the matrix B. The elements bi, of the matrix 
B can alternatively be determined using Cholesky’s method as follows: 


bu=Vau, 
byt, i=2,. 0M. 
bu 


and for j=2,...,M, 

j-l 
by=| ay- = bi 
| AT Pik 


jal ans 
Dy=(ay— & dikdjx)/dj, isj+1, ,M. 


6.3 Operation counts. Band matrices 


The number of arithmetic operations to obtain an LU-factorization of a dense 
MXM matrix (ie matrix with few zero elements) is asymptotically of the order 
M23, If the matrix is sparse, then it is possible to greatly reduce the number 
of operations by using the sparsity. This is particularly easy to do if the matrix 
Aisa band matrix, ie, there is a natural number d, the band width, such that 


a= if li—j}>d. 
A band matrix has the following form, where the shaded area indicates where 


non-zero elements may occur (some elements in the band may be zero): 
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To factor an MXM matrix with band width d one needs asymptotically Md2/2 
Operations (cf Problem 6.1), which is much less than the number M3/3 for a 
dense matrix if d is much smaller than M. 

In our applications when aj=a(qi, 69), where a(. . .) is a bilinear form and 
{$1, .. -. Gu} is a basis for a finite element space Vp, we have that 


d=max {|i—j]: @; and ) are associated with vegrces of freedum 
belonging to the same element}, 


Clearly, the band width depends on the chosen enumeration of the nodes, and 
thus if Gaussian elimination is to be used, then we want to enumerate the 
nodes so as to make the band width (nearly) as small as possible. 


Example 6.1 Let us consider the following enumeration giving minimal band 
width 


10 50 


a 


In this case we have d=5 (assuming one degree of {freedom associated with 
each node) With a horizontal enumeration instead we would haved=10, 


Vs 


Example 6.2 Ina typical application with a uniform triangulation of the unit 
square with mesh size h and node enumeration according to Example 6.1, we 
have that M=0(h~2) and d—0(h7!), and thus the work estimate for Gaussian 
elimination is in this case 0(h~4) or 0(M?). 


Note that a band matrix A is stored as a vector with eg the columns in the 
band in consecutive order. If A is also symmetric then only cg the upper 
triangular part of A needs to be stored. Thus, if A=(aj) is a symmetric band 
matrix with band width say 2, then A may be stored as a vector a=(aj) with 
the elements aj corresponding to the matrix elements ajj as follows: 


ay a, ag O 0 0 
azo as)ay:_—sCOO 0 


A= ag ag ayo O 
sym ap ay, aa 
ay? 


Remark It is sometimes convenient to allow the band width to vary from one 
column to another. To store A in this case, we again store the columns of the 
band consecutively in a vector a=(a;), We then also have to supply information 
concerning the indices of the diagonal elements. As an example, a matrix A 
with the following variable band structure 


fay ap ay O i) 0 0 
a3 as ay aw O 0 
ae ag ay O 0 
A= a ay O 0 
sym a3 ay «O 

als Alo] 

ai7 

can be stored as the vector (aj, +, ay7) together with the list of indices of 


diagonal elements (1, 3, 6, 9, 13, 15, 17). This is referred to as a skylinemethod 
of storage. O 


6.4 Fill-in 


Using (6.5) it is easy to see that if A is a band matrix with band width d, then 
so are the factors Land U in an LU-factorization of A. However, the matrices 
L and U may have non-zero elements within the band at locations where the 
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elements of A are zero. This is called fill-in. In the applications most of the 
elements of A within the band are zero (see eg (1.25)), while with usual 
ordcrings such as in Example 6.1, most of the elements of the factors L and 
U within the band are non-zero. Thus, the factors L and U contain many more 
non-zero elements than A and we have a considerable fill-in. Different 
enumerations of the nodes may give different degrees of fill-in, cf the nested 
dissection method below. Notice that the density of the factors L and U 
influence the cost of the backward and forward substitutions (6.4a, b). With 
most of the elements non-zero within the band, as is typical with usual 
orderings, this cost is O(Md). 


We will now briefly consider some common variants of Gaussian elimi- 
nation, namely the frontal method (cf [1]) and nested dissection (ct [Ge]). 


6.5 The frontal method 


In this method the assembly of the stiffness matrix and the Gaussian 
elimination are carried out in parallel. Moreover, it is not necessary to store 
the entire matrices A obtained through the elimination process in the fast 
memory, which may be difficult if M is large; instead it is sufficient at each 
step of the elimination to store just a smaller part of A) in the fast memory 
and communicate with a secondary memory only at the beginning and end 
of each step. 

Let us give some more details ot this procedure and to be specific let us 
consider the same situation as in Section 1.8. That is, let A-(ajj) be the 
stiffness matrix associated with the Neumann problem of Example 2.1 and 
the standard finite element space of piecewise linear functions on a triangu- 
lation Th={K} with basis {q 1, . . ., @a}. Suppose further that the nodes are 
enumerated so that A is a band matrix with band width d<M. The frontal 
method is based on the following facts: 


(i) The matrices A®, k=1,.. .. M, obtained through the Gaussian 
elimination, are all band matrices with band width d. To compute 
A(k+) with AC®) given, we need to change the elements aft by 
subtracting the quantities 


aK at) for i=kt1, ., ktd, jak kel... kt, 
tk) 
ayy 
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In other words, to compute A(®* it is sufficient to work with the 
(d+1)x(d+1) matrix 
k) (ky 
ayy AK kad 
Bu= : : 
: 4 tk) 
aga k- asd, ked 


occupying the following part of AM, 


LE 


(ii) In the assembly 
aj Zax(Gi. Qj). 


we add the contributions ax(@,, ¢;) from triangles K in which both 
node i and node j are vertices. Now, to eliminate the variable ey, 
ie, to take the step from AW) to A(k+), we only need to have the 
matrix elements in column and row k fully assembled, while the 
matrix element aij with i, }#k+1 may be modified ata later stage 
by adding the contributions ax(q,,qj) not yet included. 


From (i) and (ii) it follows that we may perform the assembly and elimination 
in parallel. In step k with A) given, we first assemble all remaining 
contributions from triangles K with node k as vertex, and then we compute 
A‘k+) in the usual way. In this case only the elements in Bx, the so-called 
active area, will be modified. At the end of step k we store row k of Al® (or 
A&+)), which will be row k of the upper triangular factor U in the 
factorization A=1.U, in a secondary memory and then move the active area 
one step in the south-east direction. 

The line dividing the triangles with fully assembled contributions and the 
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remaining triangles with not yet fully assembled contributions, is called the 
front. The assembly activity takes place at the front and with a suitable 
enumeration the front will sweep over the region Q in the combined 
assembly-elimination, We now consider an cxample. 


Example 6.3 Consider the following triangulation of the region Q: 


where the nodes have been numbered and the triangles are denoted by the 
letters a-i. The corresponding stiffness matrix has the following structure 
where x indicates non-zero elements. 


active x ee] 
area at K x | Xx 
step I x xX x4 x 
x xX xX xX xX X 
ent ea aie: 
x x x x x 
x x ox x 
x Xx x x 
x Xx x] 
Step 1. Assemble contributions from triangles with node 1 as vertex, ie, the 
triangles a and b. Eliminate node | (variable &,) and store row 1. 


Let x; denote the elements modified in Step 1. We have now obtained the 
following situation (note the fill-in: the element at location 23 is now non-zero 
corresponding to the fact that node 2 now is coupled to node 3 through the 
eliminated node 1). 
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active x xX xX =X 


area at xplxy xX, ky : 
step 2 My Xp Xp XY |x 
Xp} x, Xp xy xix KE 
ee Be x 
x Xx x x x 
x Xx xd & 
x Xx xX xX x 
Koo x 


Step 2. Assemble the remaining contribution from triangles with node 2 as 
a vertex, ie, the triangle c. Eliminate node 2, etc. 


6.6 Nested dissection 


In the nested dissection method one uses an enumeration of the nodes 
radically different from the ones we have used above. We illustrate the method 
in a simple example with the finite element method of the previous subsection 
on the following triangulation of the unit square Q: 


Step 1. We first view the structure or triangulation of Q subdivided into four 
substructures A-D as follows: 
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We then eliminate the inner nodes in each substructure, ic the nodes 
1to4, 


Step 2. We now combine A and B into one structure AB. and C and D into 
one structure DC: 


We then eliminate the inner nodes in AB and CD, and combine AB 
och CD into one structure ABCD: 


and climinate the inner nodes 7-9. 


Step 4. The nodes 10-25 are chminated. 
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Analogously, it is possible on more general triangulations to perform the 
elimination by successively creating larger and larger substructures and 
eliminating inner nodes. Suppose Q is the unit square with a uniform 
triangulation with step length h(P—1), where P=2?+1, p a natural number. 
with M=P? nodes. One can then show (cf Problem 6.2) that the nested 
dessection method requires 0(M?2) operations for LU-factorization of the 
corresponding stiffness matrix A. This should be compared with the 0(M?) 
Operations needed using the usual enumeration and storing A as a band matrix 
with band width M (cf Example 6.2). 

The reason that the nested dissection method is more efficient in this case. 
is the fact that it produces less fill-in. Kor general geometries, however, it may 
be rather difficult to implement the nested dissection method. 


Problems 


6.1 Show that the number of opcrations to factor a MXM matrix with band 
width d, is of the order Md2/2. 


6.2 Show that the operation count for the nested dissection method is 
0(M3?) in the example considered above. 
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7. Minimization algorithms. 
Iterative methods 


7.1 Introduction 


In this chapter we consider iterative methods tor the numerical solution of 
minimization problems of the form 


TA Min f(n), 

7.1) baat (m) 

where f: RM—> R is a quadratic function 
1 

(7.2) fim=5 1 An-b-n, 


with A a sparse symmetric positive definite MxM matrix and beRM. As we 
have seen above, application of the finite element method to a linear elliptic 
problem typically leads to a problem of the form (7.1). We know that (7.1) 
admits a unique solution §€ RM equivalently characterized by the equation 


(7.3) AE=b. 


Iterative methods for the solution of (7.1), or equivalently (7.3), play an 
increasingly important role in finite element applications. A key fact making 
iterative methods advantagcous is the extreme sparsity of the matrix A in 
standard applications. For a given type of finite element the number of 
non-zero entries in each row of A is bounded independently of the mesh size. 
This means that if only the non-zero entries of A are stored. then to compute 
An for a given ne R™ takes 0(M) operations (compared to 0(M?) if A is full). 
We emphasize that to achieve the operation count 0(M) we may not store A 
as eg a band matrix; only the non-zero entries of A should be stored, (cf 
Remark 7.3 below). 

We will consider iterative methods or minimization algorithms for (7.1) of 
the form: Given arvinitial approximation €°¢R™ of the exact solution &. find 
successive approximations &*eRM, k=1, 2, .. ., of the form 


(7.4) ERT <EkK+adk, k=0,1,.... 
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where dkeR™ is a search direction and ay>0 is a step length (note that the 
summation convention is not used in this chapter). Different methods differ 
in the choice of the search direction dé and step length ax. We will consider 
(a) the gradient method, and (b) the conjugate gradient method together with 
so-called preconditioned variants of these methods. 

We use the following notation. Given a smooth function g: RM —> R, denote 
by g' or Vg the gradient of g=g(n), ie, 

dg 8g ag } 


=Vg= % re 
. e am 3m om. 


Further, define the Hessian of g to be the MXM matrix g’=(g,)), ic, 


&g 
oni 
g’= 
ag 
Layman 
For the quadratic function f of (7.2), we have 
f"(n)=An-b. neRM, 
and 
f(Mm=A, neRM 


With Ek+! given by (7.4), we have by Taylor’s formula 
2 
+ a 7 

BCE 1) =a(EN) +cne' (Et) ahr dk a(n, 
where 1 lies on the line segment between E* and Ek* |. If the elements in g” 
are bounded in a neighborhood of &*, we thus have 

a(E1)=B(6) + ane'(ES) dk + 0(ag), as a 0. 
It follows that if 
(7.5) g’(E*) -d*<0, 


then g(E**!')<g(E*) if oy is sufficiently small. With this motivation we say that 
d* is a descent direction for g if (7.5) holds, because then g will decrease if 
we move a sinall distance from €* in the direction dé. In particular, (7.5) holds 
if we choose (see Fig 7.1) 


(7.6) dk=—g'(5t) 
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and if g'(EX)#0. In this case (7.4) corresponds to one step of the gradient 
meihod or the steepest descent method for the minimization problem 
min g(y). To choose the step-length a we may, for example, determine a, 
neR™ 


so that 
a(8k+a,d)= min g(&k+ad*), 
aad 
in which case ay is said to be optimal. To determine a, we perform a 
one-dimensional line-search to minimize g in the direction d* starting from 


&*, f ay is optimal, then 2 e(Ek+-ad*)=0 for a=ay so that (see Fig 7.2). 
ol 


7.7) g(ER*) d= 0. 


—e ih 


oH, 


Fig 7.1 
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In particular, if g is the function f given by (7.2), then by (7.7) 


O=1(Ek-+aydk) -dk=(A(EK-+ayd*)—b) - dk 
=(AEK—b) -d¥-+ondk- Adk, 


so that in this case ox is given by the following simple formula: 


(Agk—b) -d* 
7.8 & = ————_——_. 
(7.8) k dead 
Remark 7.1 Note that g’(é*) is orthogonal to a level curve for g through &* 
(a level curve for g is a curve y: [a, b] > RM such that g(y(t))=constant for 
te[a, b], see Fig 7.1). O 


We will be particularly interested in the rate of convergence of the different 
methods to be studied, ie, we will be interested in estimating how many steps 
or iterations of the form (7.4) will be needed to reduce the initial error &—§? 
by a certain factor. We will then see that the rate of convergence depends on 
the condition number (A) of A defined by: 


(7.9) (A) =o 


where 


Amax=max hy, Amm=min hy, 
j j 


and hj, j=1...., M, are the (positive) eigenvalues of A. We assume that the 
cigenvalues are ordered so that ky<h2 . . . Am, in which case of course 
Amin=A1 and Amax=Am- 

Example 7.1 Consider the special case of (7.1) with A the 2 x2 diagonal matrix 


Ay 0 
A= ; 
0 ho 


where 0<))<A2 and b=0, ie, we consider the problem 
(710) Mind (yn2+dan), 
neR?2 
with solution §=0. The level curves of f are in this case ellipses with half-axis 
proportional to V/A; and V/A. The sequence &°, &!,. . ., obtained by 


applying the gradient method with optimal step length to (7.10) is plotted in 
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F 
7A 
(ry) 


Fig 7.2 


Fig 7.2. We see that as the condition number becomes larger and the level 
curves more elongated, the sequence £9, &', . . ., has a more pronounced 
zig-zag and convergence becomes slower. O 


The above example shows that it is important to understand the behaviour 
of the condition number x(A). We will see that in a typical case when A results 
from application of the finite element method to a second order elliptic 
problem (such as eg the Poisson equation (1.16)), then 


(7.11) #(A)=0(h-?), 


where as usual h is the mesh parameter. For a problem of order four such as 
eg the biharmonic problem (2.22), one has x(A)=0(h~‘). To be more precise, 
these estimates hold if the finite element mesh is quasi-uniform, ie, all 
elements have roughly the same size (cf (7.44a) below), and if the usual 
minimum angle assumption (4.1) is valid. 

We finally conclude our preparations by recalling that 


(7.122) Apia min TAY, 


(7.12b) ana max 1 AN 


where |-| denotes the usual Euclidean norm 


M 2 
Inf=(2 03)". 
= 
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Further, defining the matrix norm 


{B| = max Bal ‘ 
neR™ | 
n#0 
for the MXM matrix B, we have by definition 
(7.13) |Bn|s/BI In| wneR™. 


IfB is symmetric with eigenvalues yy, . ., um, then we have (cf any basic course 
in linear algebra): 


(7.14) |B]=max |p). 
j 


7.2 The gradient method 


We will now study the rate of convergence of the gradient method for (7.1) 
with constant step length, ic, the method 


7.15) ektloekyadk, k=0,1, 

dk=-FG)=-(Ast—b), 
Here a is a suitably chosen (sufficiently small) positive constant. The 
appropriate size of a will become clear through the following analysis. Since 
the exact solution satisfies Ag=b, we have 
(7.16) §=§-a(AE-b), 


which after subtraction with (7.15) gives the following relation tor the error 


ek=E-EK: 


GAT) ektl=(-aAjek, k=0, 1, 
Thus, by (7.13) we have 
(7.18) Jet+!|<|T-aA| lek). 


We would now like to be able to guarantee that 
(7.19) |I-oAl=y<1. 


In this case the error would get reduced by the factor y at each step and the 
smaller y is, the more rapid is the convergence. Now, by (7.14) we have 
|I-aA!=max l1-aal, 
i 
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so that (7.19) holds if and only if 
1-ahj>-1, j=1,...,M, 


since the A, are positive. We thus conclude that a has to be chosen so that 
Ohmax<2. Choosing now a=1/Amax, Which is close to the best choice, we have 


{-aAle1— mia 21 - ay 
Nerian x(A) 


From (7.18) we thus have 
ek] <ylet| 
with Serr and by induction 


ek|<ykle%|, k=l... 


Let us now estimate the number of steps n required to reduce the initial error 
|e by a certain given factor €>0. That is. we seek the smallest n such that 


1 n 
(120) nis 


or equivalenty 


1 1 
—n log (1L- —~) 2 log -. 
g ( rT Ay) Be 
Using the easily proved fact that -log (L—x)>x for x<1, we see that (7.20) 
is satisfied if 


(7.21) n=K(A)log : 


We conclude that the required number of iterations in the gradient method 
(7.15), with a suitably chosen constant step «, is proportional to the condition 
number x(A) and the number of decimals in the error reduction factor e. In 
a typical FEM application involving a second order elliptic equation we have 
that x(A)=0(h~2) and in this case we would have n=0(h~2), ie, a very large 
number of iterations would have to be performed. 

By using an eigenvector expansion, it is possible to see more clearly why 
the gradient method is not efficient if x(A) is large. To this end, let wi... 
.. waeR™ be the orthonormal basis of eigenvectors corresponding to the 
eigenvalues 41, ..., AM of A. ie. 


AWV=AYy) j=). .M~. 
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Expanding the error e* in the basis wi... .. Wa. we have 


xe Yak 
e = Zeivi 


and the relation (7.17) takes the form 


(7.22) ckttaq~ jet j=l,....M, 

max 
with a=WVAmax. The relation (7.22) gives the error reduction for each 
component c* of the error e*, As the }j are ordered in increasing order with 
Amax=Am, we see from (7.22) that for j such that A2)ina,/2 (ie for “big” j), 
the corresponding component et gets reduced by at least a factor '/: at each 
step and a considerable reduction takes place. On the other hand, for “small 
J, Aj/Amax is much smaller than |, the error reduction factor (1- 4j/Amax) is close 
lo one and the reduction is small. Thus, error components ct for large j are 
reduced quickly, while components et for j small are only slowly reduced 
Another way of saying this is that highly oscillatory components of the error 
are quickly reduced while more slowly varying components only get slowly 
reduced. This is because the eigenvectors yj; for large j are rapidly oscillating 
and for small j the 1; vary “more smoothly’, cf Problem 8.1. 

To sum up, we may say that the gradient method efficiently reduces highly 
oscillatory components of the error while the smooth compunents only 
become small very slowly, and thus as a whole the gradient method is 
inefficient (cf Section 7.5 below on multi-grid methods where the gradient 
method is put to very efficient use.) 


Remark 7.2 The gradient method for (7.1) with the optimal step length of 
(7.8) is given by: 


(7.23a) EK! 1S BkK—ay(AEK-b), 


rhe gk 


(7.23b) Sarre 


The coavergence properties of this method are similar to those of the gradient 
method with constant steps just studied and in particular the required number 
of iterations is proportional to x(A), ¢f Problem 7.2. 0 
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7.3 The conjugate gradient method 


We will now describe a more efficient iterative method for (7.1) of the torm 
(7.4), namely the conjugare gradient method. In this method the step length 
&& iS chosen to be optimal and the search directions dk are conjugate, ie, 


(7.24) di-Adi=0. ij. 


Since A is positive definite, we may define a scalar product <. . .> on RM 
by 


<t.yeaC-An, GC eR™, 

and (7.24) can then be written 
<d, di>=0, ixj. 

The norm ||- ||4 corresponding to <. , .> is the energy norm: 
IInlla=<n. n>'?, neR™. 


The conjugate gradient method can now be stated as follows: Given B0ERM 
and d°=—r°, find s* and d*, k=1, 2,. . ., such that 


(7.25a) Ekticek+aydk 


kK. gk 
(7.25b) y= er red 
(7.250) dk le +14 Bak, 
(7.258) y= SE 
<dk, dk> 
where 


re=f'(&k)=Agk—p, 


If we compare with (7.8) we see that (7.25b) means that Ox is optimal. We 
note that the new search direction d*+! js a linear combination of the new 
gradient r&*! and the old search direction d*. Further, in view of (7.25c}, the 
condition <dk*!, dk>=0 is equivalent 10 


<—r14B.uk, dk =0, 
which is the same as (7.25d). We thus see directly that the new search direction 


d** 1 is conjugate with respect to the old direction dk, We now take anessential 
step in the analysis of the method and prove that dk! is also conjugate with 
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respect to all other previous search directions d), j=0,.. ., k (cf. [Lu]). We 
wilt need the following lomma. where we use the notation 


m r 
[n?,.. ant={neRM: n= & ain), ajeR) 
iz 
=lincar space spanned by nieR™M, j=0,..., m. 


Lemma 7.1 For m=0, 1... ., we have [d°... .. d™J=[r9,.. ., r™] 
=[r9, Ar, ..., AT]. 


Proof We use an induction argument. The stated equality clearly holds for 
m=0. Suppose now that the equality holds for m=k. We first observe that 
after multiplication by A, (7.25a) gives 


(7.26) rtlapktqy Ads, 


By the induction assumption, we have dke[r?, Ar®, —., A*r’] so that Adke 
[r°, Ar®, . . .. A&*!r°] which shows that 
(7.27) (ro... eof, Aro... ARF rH. 


On the other hand, according to the induction hypothesis, we have A*r°e[d?, 
.. 2. dS} so that Akt r9e[Ad®, . . ., Ad*) which together with (7.26) shows 


that AF ef, |... rt!) Thus we have [19 Ar’... ART cfr, 
rk+1] which by (7.27) shows that [19, Ar, ..., A&*120]=([r9,..., rk*!]. Finally. 
from (7.25e) we clearly have that [r°,.. ., F=[d0,. . dé!) and the 


induction step is thereby complete. O 


We can now prove 


Lemma 7.2 ‘The search directions d! are pairwise conjugate, ic, 

(7.28) <di, di>=0, iAj- 

Further, the gradients r! are orthogonal, iv, 

(7.29) ri-ri=0, iFj. 

Proof Suppose the statement is true for i, j<k. Since [des ealfe [es = 
1], by Lemma 7.1 we have in particular &-di=0 for j=0, — ., k~1, so using 


(7.26) 
k+l kak. ditay<dk,d>=0.  j=0,...,k-1. 


But c, is optimal and we also have 
rk#l. dk=f!(BE-+agdk) d= Zit + ad y2u,=0, 
1h 
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sork+!.di=0,j=0,...,k. Together with Lemma 7.1 this shows thatr’*!- =0 
for j=0. —., k, which proves (7.29) for i, jSk+1. Finally, to show (7.28) for 
i, jk+1 we note that since Adle[r®, . . ., 1} ‘| by (7.26), we have by (7.29) 
that <rkt!, di>=0, j=0,.. ., k- 1. Together with (7.25c) and the induction 
hypothesis this proves that 

<dktl dima], di>+iy<dk, di>=0, j=0,....k-1. 


But we already know that <dk*!, dé>=0 and thus we have proved (7.28) for 
i, j€k+1. The induction step is now complete and the lemma follows since 
the statement is clearly true for i, jst. O 


We can now prove that the conjugate gradient method gives the exact 
solution after at most M steps: 


Theorem 7.1 For some mM, we have AE™=b. 


Proof By (7.29) the gradients 1, j=0, 1, . . ., are pairwise orthogonal and 
since there are in RM at most M pairwise orthogonal non-zero vectors, it 
follows that r"=AE™—b=0 for some mSM. O 


By Theorem 7.1 the conjugate gradient method gives, in the absence of 
round-off errors, the exact solution after at most M steps. In our applications, 
however, we will view the conjugate gradicnt method as an iterative method 
and the required number of iterations will be much smaller than M. To study 
the convergence properties of the method, we first note thal by (7.25a) we 
have for k=0, 1, . .., 


ko}. 
(7.30) gk_g0— ieee 


By the orthogonality (7.28) it follows that 

<Ek, dk>=<89, dk>, k=0, 1.... 
Using also the fact that AE=b, we see that for k=0, 1,..., 

—rk. dk=—(Aek— Ag). dk=<g—ek, dh> =<t—£0, dk>, 
which shows that (7.25b) can be written 
_ <B-§0, dk> 


OK= k=0, 1, . 
: <dk, dk> 
Thus, by (7.30) we have in particular 
<ek—E0 dip =<E-20 di> = j=0.1. K-12. 
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But this is the same as saying that &‘—E" is the projection of the initial error 
E—€° with respect to <. , .> on the space 


Wy=[d9, . .., de-}] 
spanned by the first k search directions, and thus 
(7.31) E> EX|l4= [PE- E°- (SK-EMlaSlIE-S°-nlla.  Vne We. 
Recalling Lemma 7.1 and the fact that r=At_ AE=—A(§—E9), we see that 
Wiel! AM, AR IM=[AG—89,. AMEE). 


Using (7.31) we thus have the following result: 


Theorem 7.2 For the conjugate gradient method (7.25), 
IIB-EM|4|IPR(A)(E-E2)||asmax ipy(A,)| [IE~Eolla,  WPREP A. 
j 


: k 
where Py is the set of polynomials px(z)= = Byz!, ByeR. of degree at most k 
with Bg=1. am 
‘Lo estimate the reduction of the initial error |/E~€9||. after k steps, it is 
by Theorem 7.2 sufficient to construct a polynomial p, of degree at most k 


such that px(0)=1 and px is as small as possible on the interval [A1, Am] 
containing the eigenvalues of A, ie, so that the quantity 


sae ane (px(z)| 


Ze[A, Aw 


is as small as possible. The best polynomial is a Chebyshev polynomial well 
known in approximation theory, and the corresponding value of y, is (see eg 


[Ax]): 


_ k 
n=2| Va (A - . k=O, 1,2 


Vk(A)+1 ioe 
Thus, for a given e>0, to satisfy 

5-4 sell5—84| 
it is sufficient to choose n such that y,%, or by a simple computation, such 
that 


(7.32) on i Va(A) log 2. 


Ay 


We thus conclude that the required number of iterations for the conjugate 
gradient method is proportional to Vx(A) which should be compared with 
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x(A) in the case of the gradient method. ‘I'hus, for x(A) large, the conjugate 
gradient method is much more efficient than the gradient method. In atypical 
finite element application we have x(A)=0(h~2), and thus in this case the 
required number of iterations would be of the order 0(h~!) for the conjugate 
gradient method and 0(h~*) for the gradient method. 


Remark The subspace Wy=[r°, Ar’, ..., Ak~'r]=[d®, . .., dé") is called 
the Krylov subspace rclated to the conjugate gradient method (7.25). By 
(7.31) we have that ||E-&*|!4 is the norm of the difference between the initial 
error &—€9 and ils projection on Wy. 0 


Example 7.2 Let us recall the simple minimization problem of Example 7.1, 
(7.33) Min Lam?+2on3). 
neR? 2 


with O<A)<<A2. Applying the gradient method with optimal or constant step 
length, we have that the required number of iterations is proportional to 
doh}. Introducing the new variable C=(t1, &2)=(Wain2. Wa2ne). the prob- 
lem takes the form 


(7.38) Min 306548). 


The condition number of the corresponding matrix is equal to 1, and the 
gradient method with optimal step length for (7.34) finds the exact solution 
C=0 in just one iteration. This shows that a suitable change of variables may 
reduce the number of iterations significantly. We see that the very elongated 
elliptical level curves ot (7.33) are replaced by the circular level curves of 
(7.34). The possibility of reducing the condition number for more generat 
problems by a suitable change of variables corresponding to so-called 
preconditioning, will be discussed in Section 7.4 below. O 


Problems 
7.1 Show that Bx of (7.25d) can alternatively be computed as follows: 
kth pet 
AT peak 


Dea Prove for the gradient method with optimal step length (7.8) that 


lee t<c1- 5) EWES 


by proving 
SRA ie ook rk 


(ERA oh ark oko a Tek 135 


7.4 Preconditioning 


We recall our quadratic minimization problem (7.1) 
2 z 1 

7.35 Min f(n)= Min[ <n: An—b- 9]. 

(7.35) hart (n) Min[ 59 nbn] 


Let now E be a non-singular MXM matrix and introduce the new variable 
¢=En so that n=E7!&, and define 


(=H) =18“'Ey=4 (E7!t)- A(E~!t)-b- E“!t= 


= SOE TAB HE Th-t= 3. At-b-, 
where 
A=E-TAE™!, b=E-Tb, 
and E~T=(E~!)T, where DT denotes the transpose of the matrix D. Thus we 
can write the problem (7.35) using the new variable ¢ as 


(7.36) ~~ Min(LeAc-b- yy. 

teRM 2 
The gradient method with constant steps a for this problem reads: 
(7.37) &R+L=k-a(Ak—b). 
The rate of convergence of this method depends on the condition number 
x(A). If x(A)<<x(A), then the gradient method for (7.36) will converge 
much faster than the same method applied to the original problem (7.35). 

Before discussing how to choose the matrix E note that setting C=En and 

multiplying by E-! in (7.37), we get 

nktt=nk-uEE-T(Ank-b). 
Thus, setting C=LTE so that C-'=E~'E~T, we see that (7.37) corresponds 
to the following method for (7.35): 
(7.38) nkt!=n*-aC-(An*—b), k=0, 1,2. , - 
We say that this is a preconditioned version of the usual gradient method for 
(7.35) with the matrix C being the preconditioner. To compute y**! from 
(7.38) for a given nk, we have to solve the system 

Cd=(Ank—b), 
(note that we would not explicitly form C~!), 

We can now state the obviously desired properties of the matrix C=ETE 

(recall that A=E~TAE-!): 
(7.39a) X(E"TAE“)<ccx(A), 
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(7.39b) the system Cd=e can be solved with few (0(M)) operations for 
a given right hand side e 

Suppose that C=ETE is the Cholesky factorization of C, and hence E is upper 
triangular. Then (7.39b) will be satisfied if E is essentially as sparse as A, ie, 
if the number of non-zero entries in each row of E is bounded independently 
of h. On the other hand, to satisfy (7.39a) the best choice would be C= A=ETE 
with ETE the Cholesky factorization of A. in which case x(E~ TAF~!)=1. 
However, with this choice the matrix E is not as sparse as desired (cf the 
discussion of fill-in Section 6.4), and (7.39b) will be violated. With this 
background we are led to try to construct C=E!E such that E is sparse and 
E'E is an approximate Cholesky factorization of A. We may require E to have 
a Sparsity structure that is similar to that of A; for example we might allow 
an element ej of E to be non-zero only if the corresponding element aij of 
A is non-zero. To obtain an approximate factorization ETE of A with this 
structure, we may perform a modified Gaussian elimination where non-zero 
elements appearing in the elimination process at “forbidden™ locations are 
simply replaced by zeros. Such modified elimination processes (so-called 
incomplete factorizations) only take O(M) operations and result in approxi- 
mate factorizations with corresponding considerable reduction of the condi- 
tion number, (eg, x(E~! AE~!)=0(h7!), see [Ax], [Me]). 


7.5 Multigrid methods 


Recently a class of methods tor our typical system of equations (7.3) have been 
developed that arc optimal in the sense that the required number of operations 
is of the order 0(M), where M is the number of unknowns (clearly 0(M) is 
optimal since this amount of work is required just 10 write down the solution). 
These methods are the so-called mudti-grid methods (see eg [BD], [Bra], 
[Hac]). A multi-grid method is an iterative method where one uses a collection 
of successively coarser finite element grids. 

To give an idea of the basic features of the multi-grid method we consider 
the standard finite element method of Section 1.4 on a triangulation Ty 
obtained by subdividing each triangle of a coarser triangulation T2) into four 
triangles as in Fig 1.14. Let the correspongding finite element spaces be Vj, 
and Van. Then the corresponding matrices Ay and Aap have dimension MXM 
and (M/4)x(M/4), respectively. Assume that we want to solve the system 
An&€=b, and to start with assume that the system Aayn=d tor a given d can 
be solved in 0(M/4) operations. A step of the multigrid method leading from 
a given approximation E*eR™ to an improved approximation Ek*!eR™ now 
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consists of two substeps: a smoothing step and a coarse grid correction. The 
smoothing step consists of m usual gradient steps: 

(7.40) nit!=ni-a(Ann'-b), i=0.1, ., m-1, 

with a suitably chosen (cf Section 7.2), and n°=&*. This step gives the 


1 
k+e 
approximation & *=n™. The coarse grid correction is obtained as follows: 


Let 5€V 2p be the solution of the problem 


‘ ke} 
(7.41) a(5, v)=(f, v)-a(u 2.) Wve Von, 
rea ra 
where u 7eVp is the finile clement function with nodal values ¢ 2 
k+i OM 
Let 5 2eR?@ be the vector of nodal values of 8 and define 


1 k+l 


ekg 245 2, 
krd k+J M 
where the components of 8 7¢R™ are given bythe componentsof3 7eR4 


1 

for the nodes of Tap, and the value of a? at other nodes are obained by 
linear interpolation from the valucs at the nodes of T2y. We note that the 
correction step (7.41) corresponds to a problem of the form Azpn=d which 
can be solved in 0(M/4) operations by assumption. 

To sum up, a multigrid step leading from &* to &**! consists of a simple 
smoothing step together with a coarse grid correction requiring few oper- 
ations. Under suitable assumptions une can prove that there is a constant C 
independent of § and k such that 


g—ek+!]<€je—ek, 
m 


which proves that for m sufficiently large cach multigrid step reduces the error 
significantly. 

The algorithm is now applied recursively so that to solve a problem of the 
form Ajyn=d in the step described above, we invoke a coarser grid with 
corresponding matrix A4,, assuming that Tz, is obtained as above by 
refinement of the corser grid Typ. This gives a procedure where we work on 
a sequence Th, Tan. Tan, Tgh. . . .. of successively coarser grids ending with 
a coarsest grid for which the corresponding linear system can be solved by 
direct Gauss elimination with few operations. One can show that this 
combined process will give a solution of the original matrix problem A,§=b 
in 0(M) operations. 

The reason why the multigrid method is so efficient is, roughly speaking, 
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the following: In the smoothing step the high frequency components of the 
crror (corresponding to large eigenvalues) are significantly reduced. This fact 
is easy to understand from the analysis of the gradient method in Section 7.2 
above. Further, in the coarse grid correction the low and medium frequency 
components of the error are also significantly reduced and thus in cach 
multigrid step all components of the error are reduced significantly 


7.6 Work estimates for direct and iterative 
methods 


Here we collect the principal results presented above concerning the amount 
of work required to solve our typical system of equations 


(7.42) AE=b 


by direct and iterative methods, where A is a sparse, symmetric and positive 
definite MxM matrix. We then suppose that (7.42) is related to a second order 
elliptic problem in R¢, d=2 or 3. In this case M=0(h7“) and the condition 
number x(A)=0(h~2). We fucther assume that in the preconditioned variants 
of the conjugate gradient method the condition number is reduced to O(h7!). 
Also, in the Cholesky factorization we assume that A is stored as a band matrix 
with band width O(h-4*!), With these assumptions we have an asymptotic 
work estimate for the solution of (7.42) of the form 0(M*), where the 
exponents a are given by: 


Method dim 2 dim 3 
Band-Cholesky: tactonzation = 2 2.33 
back-substitution 16 1.67 
Nested dissection: tactorizaton LS 2 
back-substitution i} 1.33 
Conjugate gradient 15 1.33 
Preconditioned conjugate gradient 1.25 1A7 
Mullignd 1 1 
Fig 73 


Clearly, the multigrid and preconditioned conjugate gradient method have 
the most favourable exponents and for M large enough will be superior to 
band-Cholesky and nested dissection. This holds particularly for d=3. 
However, the multigrid method requires a rather complex program with 
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considerable overhead to organize the computations while band-Cholesky 
requires little overhead. Thus, fora given M itis not clear which method would 
require the least total cost, and of course this cost also depends on the problem 
and on the implementation of the particular method. 

We also note that one sometimes wants to solve the system AE=b many 
times with the same A but different right hand sides b. For example we may 
want to compute the stress distribution in an elastic body under various loads. 
In this case we may factorize the matrix A once and for all and then only a 
back-substitution will be required for each new right hand side. In such cases 
band-Cholesky becomes comparatively more competitive but still js asymp- 
totically inferior to the preconditioned conjugate gradient and multigrid 
methods. 

To sum up we may say that, roughly speaking, band-Cholesky may be used 
for coarse to medium fine discretizations in two dimensions whereas iterative 
methods multigrid or preconditioned conjugate gradient type would be 
advantageous for large three-dimensional problems and for very fine discre- 
tizations in two dimentions. Let us remark that these conclusions should be 
valid at least on well-structured problems with coefficients that are not varying 
too much and using eg quasi-uniform finite element meshes. For problems 
with highly variable coefficients and very complicated solutions it may be 
difficult to find iterative methods with good convergence properties and in 
such cases Gaussian elimination may be the only realistic alternative at 
present. 


Remark 7.3 To store only the nonzero elements of a sparse symmetric MXM 
matrix A=(aj)), one may use a vector a=(a(i)) containing the elements in the 
lower triangular part of A ordered row by row, together with a vector 
ac=(ac(i)). with ac(i) the number of the column in A containing the element 
a(i), and the vector ad=(ad(j}), with ad(j)=i where a(i)=a;j. As an example. 
if 


an 
al2  a22 sym 
A=]0 — a3q_ aga F 


an 9 Aas Aa 
as: 0 asx 0 ass 
then we have 
a=(ar1. 212, a22, a32, 433, Asi, O43. Aas. AS), ASS. ASS), 
ac=(t, t, 2,2, 3, 1.3.4, 1,3, 5). 
ad=(1, 3, 5, 8, IN). a 
Problem 


73 Determine the asymptotic work estimates corresponding to Fig 7.3 for 
a fourth order elliptic problem. 


7.7 The condition number of the stiffness matrix 


If A is the stiffness matrix related to an elliptic problem of order 2m, then 
the condition number «(A) is under suitable conditions estimated by 


(7.43) #(A)=0(h~2"), 
Let us prove this result in the standard case m=!. A=(ajj), a, =a( Gi. Fj). 
a(v, w=] Ve. Vw dx, 
2 
with @y,... @M. the usual basis for Vy={veHg(Q): vikePi(K). KeTa). 
where Qc R?. This is the case studied in Section 1.4. 
We shall assume that the family {Ty} of triangulations Ty={K} satisfies the 


following conditions: There are positive constant By and Bz independent 


of h=max hx such that for all KeTy, The{ Th}. 
KeT, 


(7.44a) hy 2 Pyh. 


(7.44b) 2K 2 Bp, 
hk 


where hy and 0x are defined as in Section 4.2. The condition (7.44a) states 
that all elements K of Tp are of roughly the same size. Such triangulations 
are said to be quasi-uniform. 

We recall that the bilinear form af. . .) 1s HQ) — elliptic, ie, there is a 
positive constant @ such that 


(7.45) aly, v) 2a) vila) WveH, (2). 


The estimate (7.43) with m=1 will easily follow from the following result: 


Lemma 7.3 There are constants ¢ and C only depending on the constants 6; 


M 
in (7.44), such that torall Yee MGiE Vn 
i= 
(7.46) Chen)? |Iy|/2<ChIMP, 
(7.47) a(v, v)= f|Vvi2dx = Ch74|!y|]?. 
2 
where ||vij=||v[l1.(2) 


Remark The estimate (7.47) is a so-called inverse estimate; here we estimate 
the Ls-norm of the gradient of v in terms of the Lz-norm of v itself. This is 
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not possible fur a general function v, but it is possible for the functions v in 
Vj}, at the price of the factor h7!. O 


We postpone the proof of Lemma 7.3 and show how to prove (7.43) using 


M 
the lemma. We recallthat ifv= > nip, then 
i=} 


a(v, v)=n- An. 
so that by (7.46) and (7.47) 
f eles 
cag) EAN A) ccy-? MP ce WneR™, 
Ini? In? nl 


2Cuh? WneRM. 


‘Together. (7.48) and (7.49) prove that there are constants c and C such that 


AmaxSC.  Amin2ch?, 


which gives the desired result (A) = Ch-2, 
min 


Remark 7.4 Note that it is natural to scale the matrix A. by multiplying with 
a constant of order 0(h~2). so that Amaxy=O(h7?) and Amin=O(1) (cf (1.25). 
With this scaling A will be a discrete counterpart of the Laplace operator with 
eigenvalues ranging from O(1) to O(h-2) (recall that the eigenvalues ot the 
Laplace operator on a bounded domain lie in the unbounded interval (A, %) 
tor some positive A. O 


Let us give 


Proof of Lemma 7.3 I is sufficient to show that for each triangle Ke Tp with 
vertices a! and veP\(K), we have 


3 F 3 
(7.50) Chz Za? s livia Chk = Iv(a)|?, 
i= iz 
(7.51) SIV vPdx < Chg? flvPdx, 
K K 


with ¢ and C independent of K and v. From these estimates the desired 
estimates (7.46) and (7.47) directly follow by summation over KeTh. 
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We first show that (7.50) and (7.51) hold when K=K where K is the reference 
triangle with vertices at (0, 0), (1, 0) and (0, 1) in a (2), %2) - plane (see Fig 
7.3). Let Ki be the usual basis functions of P;(K) and define 


fy(A)— JI Vea, 
K 

fr{i)= {6a 
K 


where §=(fi, fz, 3) and 


We observe that fy and f) are continuous functions of ffeR?. We now consider 
the quotient 


i(fy= 2, er? 40. 
f(A) 

We want to prove that there is a constant C such that 
(7.52) 1(A)SC, feR®.  Hx#0; 
this inequality clearly corresponds to (7.51) in the case K=K since hk=V 2 
To prove (7.52) we first note (hal 

f(y=h(h). VyeR. y#0, 
ie, the function ts is homogeneous of degree zero. It is thus sufficient to prove 
that for some constant C 
(7.53) h(q)<C eB, 


where B={feR*: |h]=1}. But fs is continuous on B (note in particular that 
f:(4)#0 for eB) and B is a closed and bounded set in R}, and thus f; has 
a maximum on B. This proves (7.53) and thus (7.52) and (7.51) follow in the 
case K=R. In the same way we can prove (7.50) in this case. 

It now remains to prove (7.50) and (7.51) for an arbitrary triangle Ke Th. 
For simplicity assume that K is a triangle with vertices at (0, 0), (h. 0) and 
(0, h) so that hk~ V2h and let the mapping F: KK be defined by (see 
Fig 7.3) 


x=P(R)=(hty, hd2), £ER. 


x 
(h,0) i 
Fig 7.3 
Given veP\(K) we now define 
(7.54) WR)=V(x)=V(F(R)). KEK. 
Clearly we then have ve P,(K). By the chain rule 
av av ax, | Ov Ox2 _ av im12 


Bk Ox, BRO, OR OX; 

and so Vo=hVv. Since dx=hdk, this gives 
J) Vvedx= Sh-2| V oPdx= J] 9 e]2dk 
K K Pa 


<Cfidg=CJvh-2dx=Ch-2fv2dx, 
Rg kK K 

where we used that (7.51) holds if K=K. This proves (7.51) if K has vertices 
at (0.0), (h, 0), (0. .h), and in a similar way (7.50) can be shown in this case. 

Finally, if K is an arbitrary triangle, then we introduce the linear mapping 
F: KK that maps K onto K: 

x=F(3)=a!+(a?-a!)h1+ (aa! )x?, 

where ai are the vertices of K. Arguing now as in the above special case and 
using the tacts that Jai—a'l<Chx, i=2, 3, and dx=Chzdk by (7.44). we now 
obtain (7.50) and (7.51) in the general case (cf Problem 7.4) and the proof 
is complete. 


Remark The technique of working with a reference element K and linear 
mappings F: K- K that map K onto the triangles Kel), is very important 
also from a practical point of view. In this way it is often possible to use a simple 
transformation to obtain the stifiness matrix of a general element K from the 


144 


stiffness matrix of the reference element K. This is important since the direct 
eee ofa stiffness matrix may require a not-negligible amount of work 
and the element stiffness matrices of all 

: F s s elements KeT; have to be 
determined. O 


Problems 
7.4 Complete the proof of Lemma 7.3. 


7.5 The condition (7.44a) Stating that all elements have the same size can 
be relaxed. Prove without using (7.44a) that the estimate «(A)=0(h-2) 
may be replaced by 

x(A)=0(h5 2) where Duin= min hr, 
KeT, 


7.6 Let un€ Vac L2(Q) be the L2(S)-projection of ueL(Q il 
defi 
Problem 4.8), ee al 


(un. v)=(u, v) We Vy, 


In matrix form with the basis {P1,.... m} for Vj). this problem takes 
the form B§~=b, where B= (by), by=(qi, Gj), D=(bj), bj=(u. gp). Prove 
that B is positive definite. When Vp is piecewise polynomial on a 
quasi-uniform triangulation in R?, show that x(B)=0(1). The matrix 
Bis called the »»ass matrix and will occur in Chapters 8 and 9. 
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8. FEM for parabolic problems 


8.1 Introduction 


In this chapter we give an introduction to finite element methods for linear 
parabolic problems. A typical such problem. modelling heat conduction inan 
isotropic body with heat capacity 4 and conductivity p and occupying @ region 
2cR4, reads as follows: 


ya-div(uVu)=et in QXx1, 
(8.1) u=0 on I) x). 
du on Tx, 
"on 
u(x, 0)=u"%x) xeQ. 


Here u(x. t) is the temperature at xe at time tel=(0. T). where is a given 
time, u® is a given initial temperature, fis a given heat production, Y and 
T2 is a subdivision of the boundary T of Q and u=sw69t. For simplicity we 
shall consider the following special case of (8.1) with A=p=1, QcR? and 


(8.2a) u—Au=f in QxI, 
(8.2b) u=0 on Px, 
(820) uf, Ou". 


Essential parts of the presentation based on (8.2) that follows, may directly 
be extended to the more general problem (8.1), (cf Example 2.7) 

We will first consider a so-called semi-ciscrete analogue of (8.2) where we 
have discretized in space using the finite element method. To obtain a tully 
discrete problem we will then discretize time also. We shall see that the 
semi-discrete problem is an initial value problem for a system of ordinary 
differential equations. This will be a stiff system which will pose extra 
requirements on the stability of the methods to be used for the time-discre- 
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tization (the notion of an initial value problem for a stiff system of ordinary 
differential equations will be explained in Section 8.3 below). For the 
time-discretization we shall first consider two classical methods foi stiff 
problems: the backward Euler method and the Crank-Nirolson method. We 
shall then consider a recent method. the so-called discontinuous Galerkin 
method, based on using a finite element formulation in time with piecewise 
polynomials of degree q. In simple cases (eg A=p=1 and f=0 in (8.1)) the 
discontinuous Galerkin method gives time-discretization methods which 
coincide with classical methods based on the so-called sub-diagonal Padé 
approximations. In particular, with q=0 one obtains the backward Euler 
method. he advantage of the discontinuous Galerkin method is that eg 
variable coefficients and non-zero right hand sides (and even non-linearities) 
present no complications in principle. Further, the fact that the method has 
a variational formulation is very useful in the analysis of the time-discretization 
errur. In fact, one can derive precise error estimates for the discontinuous 
Galerkin method which make it possible to construct (for the first time) 
rational efficient methods for automatic time-step control which are of 
particular importance for stiff problems. We comment briefly on this topic 
in Section 8,4. The new possibilities offered by the discontinuous Galerkin 
method have been discovered only recently and are still under exploration, 
see [EJT], [J3]. [EJ1]. [JNT]. [EJL]. For morc information on finite element 
methods for parabolic problems. see also [Th]. [LR] and the references 
therein. 


8.2 A one-dimensional model problem 


Before going into the discussion of the numcrical methods for (8.2) we shall 
briefly indicate some of the main properties of the exact solution u of (8.2). 
For simplicity we will then consider the following one-dimensional model 
problem modelling heat conduction in a bar (cf (1.3)): 


du Ou 
8.3: —- =f O<x<a. >, 
CN oe ae — 
(8.3b) u(0,0)=u(a,t) =0 1>0, 
(8.3¢) u(x,0)=u"(x) O<x-<m. 


In the case [=0, we have by scparation of variables that the solution of (8.3) 
is given by 


(8.4) u(x.t)= 5 ule “I sin(tx), 
vl 


147 


where 


7 
uf'= V2hn f u@dsin Gx)dx, [Rees ay 
0 


are the Fourier coefticients of the initial data u° with respect to the ortho- 
normal system (V2isinUix)}? in T.2(0,1). By (8.4) we see that u(x.t) is a 
linear combination of sine waves sin (jx) with frequencies j and amplitudes 
uP exp(—j?t). We may say that each component sin (jx) lives on a lime scale 
of order 0(j-*) since exp(—j*t) is very small for j?t moderately large. In 
particular we have that high frequency components quickly vet damped. Thus, 
the solution u(x.t) will become smoother as t increases. This of course fits with 
the intuitive idea of the nature of a diffusion process such as heat conduction. 
However, in general u(x.t) will not be smooth for t small, and we may have 
that |{u(t)[]=||u(- vj] -> © as t—> 0, where |j- |) denotes the L2(0, 2)-norm 
More precisely, the size of the derivates of u (with respect to t or x) tor t small 


will depend on how quickly the Fourier coefficients u? decay with inercasing 


j: For example, if u(x) =-x for 0<x<n, then w= Ci, in which case 
[Ju(t)}]~Ct-® with a= 3/4 as t->0, and if u%x) is the “har function” 
u9%(x) = min(x.t—x) for 0<x <a, then up = C/j? in which case |iu(t)]]~Ct-# 
with a= 1/4 as t 0 (cf Problem 8.1). If up decays taster than 7-2 as j &, 
then |/u(t)|} will be bounded as (0, but higher derivatives may still be 
unbounded. In principle, the “smoother” the initial function u® is, the more 
rapidly uf decays as j-> ©. Note that here a “smooth” initial function has to 
satisfy in particular the boundary conditions (8.3b). 

An initial phase for t small where certain derivatives of are large. is called 
an initial transient. Thus the exact solution of a parabolic problem in general 
will have an initial transient where certain derivatives are large. but the 
solution will become smoother as t increases. [his fact 1s of importance when 
solving a parabolic problem numcrically, since it is advantageous to vary the 
mesh size (in time and space) according to the smoothness of the exact solution 
u and thus use a fine mesh where u is non-smooth and increase the mesh size 
as u becomes smoother. Note that transients may also occur for t>0 if for 
example the right hand side f (or the boundary conditions) i (8.1)-(8.3) vary 
abruptly in time. 

The basic stability estimates in our context for the problems (8.2) and (8.3) 
are in the case f=0: 


(8.5) {juil<flu"ll, tel. 
(86) [la(yl<Sho%l, tel, 
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For the problem (8.3) these estimates follow directly from (8.4) using 
Parseval’s formula together with the facts that OSe75<1 and O<se °<C for 
820. It is also possible to prove (8.5) and (8.6) using “energy methods” 
without relying on an explicit solution based on separation of variables (cf 
Problem 8.6 below). Note that (8.6) states that if uel o(Q), then 
[Jay |l=0(t-4) as t 0. 

Let us now turn to the discussion of numerical methods for (8.2). 


8.3 Semi-discretization in space 


The semi-discrete analogue of (8.2) will be based on a variational formulation 
Of (8.2) which we now describe. Letting V=HiK(Q). multiplying (8.2a) for a 
given t by ve V, integrating over Q and using in the usual way Green’s formula. 
we get with the notation of Section 1.4: 


(u(),v) +a(u(t).v) =(1@),¥). 
Thus, we are led to the following variational formulation of (8.2): Find 
u(QeV, Ic], such that 
(8.7a) (0(t).v) 4a(u(t),v)=(f(t).¥) VveV, tel, 
(8.7b) u(0)=u", 

Now. let Vp be a finite-dimensional subspace of V with basis {@}, . .  @u}- 
For definiteness we shall assume that Q is a polygonal convex domain and 
that V, consists of piecewise linear functions on a quasi-uniform triangulation 
of Q with mesh size h and satisfying the minimum angle condition (4.1). 


Replacing VY by the finite-dimensional subspace V, we get the following 
semi-discrete analogue of (8.7): Find un(Qe Vy. tel, such that 


(8.8a) (an(t),v) +a(un(t),v)=(£(t).v) VveV), tel, 
(8.8b) (un(0),v)=(u"v) VWveVh. 

Let us rewrite (8.8) using the representation 

9) axltay= 2 SDC, tel. 


with the time-dependent coefficients E,(tleR. Using (8.9) and taking V=Qj, 
J=l, +. M, in (8.8), we get 
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Mi. M _ 
2 EO (Gi HF E SOAGiV =), jel. o..M, tel. 
t=! . 


M F 
E 8 (0) (91.4) =F) He Lp iM 


or in matrix form 

(8.1Ua)- BE(t)FAE(t)= F(t). tel, 

(8.10b) —- BE(0)=U®, 

where B=(bj). A=(a,). P= (FR). $=(G), U=(U)). 


by=(qi.0)) = Syiqydx. 
Q 
ajj=a(i-,) ={V9 Vg dx. 


F(Y=().0)), UL=(u"). 
Recall that both the mass matrix B and the stiffness matrix A are symmetric 
and positive definite. Further x(B)=0(1) and x(A)=0(h7*) as h-»0 (sec 
Problem 7.6). Introducing the Cholesky decomposition B=ETE and the new 
variable 1= EE, the problem (8.10) takes the slightly simpler form 


AH+An(H=e(t), tel. 

n(0)=n°. 

where A=E-TAE-! is a pasitive definite symmetric matrix with 
#(A)=0(h~2), g=E TF, n°=E-TU® and E T=(E7!)T=(E!)7!. The solution 
of (8.11) is given by the following formula (see any book on ordinary 
differential equations): 


(8.11) 


si fing ahs 
(8.12) a(tyre Atyl+ [e-Alt-sg(syds, tel. 
a 


The problem (8.11) (and (8. 10)) is an example of a stiff initial value problem, 
the stiffness being related to the fact that the eigenvalues of A arc positive 
and vary considerably in size corresponding to #(A) being large. 

Let us now return to our semi-discrete problem in the formulation (8.8). 
A basic stability mequality for this problem, with for simplicity {=9, is 
obtained as follows: Taking v=un(0 in (8.8a), we get 


(y(t), u(t) +a(up(t). un(Q)=0, tel. 


or with as above {} {|= |] + Ile, 
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5S lus(ty?-+aCuntt), uxtt))=0, 
so that recalling also (8.8b), 
1 
|lun(tyl?+2 fa(un(s), un(s))ds=|lun(O)| [<1] u"|[?, 
9 


and thus in particular, 
(8.13) |Jun(t) ts<[lu,(O)I [ssf u%||, tel. 


This estimate is clearly analogous to the estimate (8.5) for the continuous 
problem. Note that (8.5) may also be proved in the same way as (8.13). 

For the semi-discrete problem (8.8) one can prove the following almost 
optimal error estimate. Recall that we are assuming, for simplicity, that &2 
is a convex polygonal domain and that Vj consists of piecewise linear functions 
on a quasi-uniform triangulation of Q with mesh size h. 


Theorem 8.1 There is a constant C such that it u is the solution of (8.2) and 
Up Satisfies (8.8), then 


te 


(8.14) maxilu() apis 1+ | ioe 5 | | max b?llu(O)hr%op- 
te bes i 


Proof The proof is based on a duality argument involving the following 
auxiliary problem: Given tel let @p: (0. t)— Vj satisfy 


(8.15a) —(@n(s).¥) 4 a(gu(s).v) =O VveVn. se(0, t), 
(8.15b) gn(t) = enlt), 

where e}(s)=un(s)—t,(s) and i,(s)e Vy, satisfies 

(8.16) a(u(s)—up(s),v) =0 YveVp. se(0, T). 


Now, taking v=en(s) in (8.15a), using (8.7), (8.8), (8.16), writing 
6(s)=u(s)~tip(s), and integrating by parts in time we have 


lent? FL (an(s.en())+a(ca(s) xfs) ]ds-+Cn(0, e400) 
=f{len. 9) +2(em, o)lts-+((0), €4(0) 
=J18. on)+a(8, nes+(20). (00) 


1$1 


L 
oe 5(@, Gn)ds+(B(t), n(t))- 


Thus, we have the following simple error representation formula 
t 
(8.17) Ilen(t)||?=— JOG), Pn(s))ds+ (8(t), Gn(t)). 
Now, (8.15) is equivalent to the ordinary differential equation (cf (8.10)) 


—BE(s)+AC(s) =0, se(0, 0), 
cHae. 


Using the explicit solution of this problem corresponding to (8.12) (or (8.20) 
below) and using also Lemma 7.3, we eusily find that there is a constant C 
independent of e}(t) and t, such that (cf Problem 8.2) 


(8.18) Ilpa(s)Ilsien(t)l], Os <1, 

L 

Le t 
(8.19) {leen(s)ilds « C(L+llog 55 Dilen(ol. 
which combined with (8.17) proves that 

llen()//<CO Flog 4) max |]0(s)]]. 

len(Hll eC Flog pa) tne (s)II 
Note that the estimates (8.18) and (8.19) correspond to the estimates (8.5) 
and (8.6) for the continuous problem. To complete the proof we now just note 
that u-u,=u~Up+i,;—un=0~ep, and we then obtain the desired estimate 


(8.14) using the L2 — estimate for 6(s)=u(s)—iy(s) of Theorem 4.3. 0 


Remark Note that the constant C in (8.14) is in particular independent of 
T. 


8.4 Discretization in space and time 
8.4.1 Background 


We shall now consider some methods fur time-discretization of the semi- 
discrete problem (8.8) resulting in fully discrete analogues of (8.7). Let us then 
first consider the related problem (8.11). The qualitative behaviour of the 
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solution y(t) of this stiff initial value problem is (of coursc) similar to that of 
the exact solution u of (8.2) as discussed in Section 8.2. In particular we have 
the following representation for the solution of (8.11) in the case g=0 (cf 


(8.4)): 
M 

(8.20) d= =n? zien, 
ae 


where GoM, are the orthonormal (in R™) eigenvectors of A with corre- 
sponding eigenvalues WS. . Sm Satisfying w1=0(1) and ww=0(h 2). Here 
the large eigenvalues uj correspond to rapidly “oscillating” eigenvectors i 
while smaller eigenvalues correspond to “smoother” eigenvectors (cf Problem 
8.1). By (8.20) we see that n(t) has components that live on time scales in the 
wide range from 0(h7”) to 0(1), that high frequency components of n{t) are 
quickly damped and that y(t) in general will have an initial transient. Note 
that the stiffness of (8.11) is reflected by the fact that the solution n(1) contains 
components with vastly different time scales. 

As indicated, stiff problems like (8.11) put special demands on the methods 
to be used for time discretization. First. for stability reasons onc has, in order 
to avoid excessively small time steps, to use so called implicit methods. ie. 
methods requiring the solution of a system equations al each time step. 
Secondly, one would like to use methods which automatically adapt the size 
of the time steps according to the smoothness of n(t) and thus automatically 
take smaller time steps in a transient and larger steps when r(t) becomes 
smoother. 

We will first briefly consider two classical methods for time-discretization 
of (8.8) or equivalently (8.10) and then the more recent discontinuous 
Galerkin method together with methods for automatic time step control. Let 
O=19<)<. . .<ty=T denote a subdivision of [and write [,=(t,-1. In) and 
let kp=tn—tp—1 be the local time step. 


8.4.2 The backward Euler and Crank-Nicolson methods 
In the classical backward Euler method for the semi-discrete problem (8.8) 


we seek approximations upeV), of u(. . t,). n=0,. ., N, satisfying 
ut—ul-t 
(8.21a) ( Ho | ta(uy.v) =(f(ta). v) VveVp.n=l.2.....N. 
a | 
(8.21b) (ug.v) = (u,v) YveVp. 


Clearly, (8.21a) has been obtained from (8.8a) by replacing the derivative 
Up(t,) by the difference quotient (uj —up~!)/k, with discretization crror O(ky). 
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For a given u}-! we have that (8.21a) corresponds to the following positive 
definite symmetric system of equations for the unknown ul: 


(8.22) (B+knA)s"=BE"!+knF(In), 
where 
n be en 

uns 2 BIG. 
A basic stability estimate for (8.21), with f=0 for simplicity, is obtained by 
taking v=up in (8.2la) to yield 

Tug]? Cuan!) ta(uy, an) ky=0. 
Using here the fact that 


vl | 
Cheng Ys Hugh Sag -12, 


we conclude that tor n=1, 1 N 


1 1 -4I 
sllunil?—- gltah "NP +a(uy upkyso, 
so that by summation 
a 
[up] 2 E aah on) km Sluj ssl ui2, 
m= 


and in particular 


(8.23) [lugl<ilupliss|lwll forn=t,...,N 


which is clearly analogous to (8.13). 
The other classical time-diseretization method for (8.8) is the Crank-Nicol- 
son method: Find uve Vy, n=O... ., N such that 


uf-ut } naan d 
(8.24) ( yy ta( Oe vy) = (dy 
_ 
WvevV,, n=l, LN, 
(8.24b) (ufv)~(uv) YWweVa. 


Here, the difference quotient (u;—uf~!)/k, replaces (U(ty)+0(ty—1))/2 and 
the corresponding discretization error is O(k2). This time we obtain the 
following system of equations on each time level: 


(8.25) [® + se a) g1= | B- a] 0-14 ky (F (ty) + F(ty-1))/2. 
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By taking v=(up+uf')/2 in (8.24a) we easily obtain again the stability 
inequality (8.23) in case f=0. 

For the problem (8.11) in matrix form the backward Euler and 
Crank-Nicolson method read: Find n"eRM, a=1, 2, .. ., N such that for 
n=1,. .,.N 


Alyn-l | 

(8.26) D1 _ + An=g(ta). 

kn 
man la oye 

+l Ay ty Y= +g(tn-1)). 

ie Fi Car tnt = (Btn) 2(tn-1)) 


In the case g=0, (8.26) leads to the following matrix equation for 1": 


(8.27) 


(8.28) (I+ k,pA)yt=n"7). 
With the notation of Chapter 7, we have 


<1 


8.29 t+kpA)7 |= ; 
( ) i( nA eee 1+katt 

since the eigenvalues py, of A are positive. From (8.28) and (8.29), we have 
that 


(8.30) Intle|ne i. ein, n=1!,...,N, 


which is another way of stating (8.23). Similarly we have for the Crank- 
Nicolson method 


(8.31) (45 nA) mP=(I— 5 Ky) poh 


and 


; Lk aul 
k,A)|= max —*—-.< 1, 


[G+ F kA)" 
4 145 katt, 


—i 
a 


which again implies (8.30). 

Not all time-discretization methods for (8.11) (or (8.10)) satisfy stability 
estimates of the form (8.30) (or (8.23)) regardless of the size of the time steps 
ky. As un example, let us consider the so-called forward Euler method for 
(8.11): 


noyn-bo 
(8.32) se —+ Ar! =e(tn-1), 


a 
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or, with g=0, 
(8.33) ms(tak pA) a! 
Here 


I[=k,A|=max |1—kayj| =|! —Kaual<! 
J 


only if knum<2. or, ky<2/uy=O(h?) since uy=O(h-?). This means that for 
the forward Euler method (8.32) we can only guarantee the stability inequality 
In? F=<]1)"| if the time step kp is sufficiently small, or more precisely if 


(8.34) k,sCh2, 


In other words, the forward Euler method (8.32) is conditionally stable undcr 
the condition (8.34). If |I-k,A|=a2>1 with a independent of n, then the 
forward Euler will be unstable and useless for computational purposes and 
thus the method can only be used under the condition (8.34). This condition 
is very restrictive requiring very small time steps k, if h is only moderately 
small. 

One way of phrasing the stability condition (8.34) for the Euler forward 
method for (8.32) is to say that k, has always to be chosen so small that the 
fastest time scale is resolved. Of course this is a natural condition in the initial 
phase of a transient where the “fastest”’ solution components play a role, but 
not so vutside this phase. In contrast to the forward Euler method, the 
backward Euler and Crank-Nicolson methods are both stable regardless of 
the size of the time steps ky, 1e, these methods are unconditionally stable. This 
is a very desirable property of a time-discretization method for a parabolic 
problem. 

In (he backward Euler and Crank-Nicolson method we need to salve a 
system of equations at cach time step (see (8.22), (8.25), (8.28). (8.31)), ie, 
thesc methods are implicit, whereas for the forward Euler method the solution 
n't! is directly given by 1" without solving any system, (see (8.33)), ie, this 
method is explicit, Clearly an implicit method requires more work per time 
step as compared with an explicit method. Thus, on the one hand we have 
unconditionally stable implicit methods and on the other hand conditionally 
stable explicit methods, The more efficient methods for parabolic problems 
belong to the first class; the extra cost involved at each step for an implicit 
method ts more than compensated for by the fact that larger time steps may 
be taken (outside very fast transients, where accuracy requires very small time 
steps). 
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8.4.3 The discontinuous Galerkin method 


We shall now consider the discontinuous Galerkin method for (8.8) which is 
based on using a finite element tormulation to discretize in the time vartable. 
To formulate this method we introduce for a given non-negative integer q the 
space 

Wir= {vi Eo Vii vijePyn). msl. N}, 
where : 

Pa(tn)= {vi In Vai v= Evi with vie Vi}. 

i= 

ie, Wax is the space of functions on T with values in Vy that on each ume 
interval |, vary as polynomials of degree at most q. Notice that the functions 
vin Whk may be discontinuous in time at the discrete time levels t,. To account 
for this we introduce the notation 

v" = lim v(tyts), v2= lim v(tp+s). 

sate 50° 


n 


[v{]=v2-v 


where [v"] is the jump of v at t). 
The discontinuous Galerkin method for (8.8) can now be formulated as 
follows: Find Ué Wp such that 


(8.35) A(U, v)=L{v) Vve Wok. 
where 
N 
A(w. v)= x, J((¥, v)+aow, v))dt 
n=l] 


N =f Q Q 
+ E(w}. v9-94 wh 0), 
2 


n=. 


Ly) = Jf, vydt+(u4, v 9). 
1 


Since vEWh, varies independently on each subinterval I,, we may alter: 
tively formulate (8.35) as follows: For n~1, , N, given UL", find 
USUN,¢Pq(In) such that 


(8.36) f((QU, v)ta(U, wd (UTE v= 
i, 
fat yare(un twin}, WveP(In), 
i, 


where U" =u" 
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ee 


For q=0, using the notation U"=U1=U% |, (8.36) reduces to the 
following problem: For n=1,..., N, find U"eV), such that 


(8.37) (U"-U""! v)tkaa(U", v= JU wide WweVa. n=l, NA 
i, 


wheie U" =u. This is a simple modification of the backward Euler scheme 
(8.21) where the right hand side involves an average of f over I, rather than 
the value of f at tn. 

For q=1 we have that (8.36) is equivalent to the following system with 


with U(t)=Uy + a! 


Ui. tely, UeVn, 


(Uo.v)-+kya(Un,¥)+(UL¥)+ 5 knalUs.¥) 
=(US-lv)+ f(f(s)vids. Ye Vn, 
Th, 


(8.38) 
L 


5 Fn(Llov) + 5 (Wiv)+ ; kaa(U.v) 


f (s—ty-1) (f(s),v)ds, VVEVp. 
i, 


bam 


n 


8.4.4 Error estimates for fully discrete approximations and 
automatic time and space step control 


We now give an error estimate for the discontinuous Galerkin method (8.35) 
in the case q=0, ie, the backward Euler scheme (8.37) with, as above, Vp 
piecewise linear on a quasiuniform triangulation satisfying (4.1). The proof 
is given in Remark 8.1 below. 


Theorem 8.2 Suppose there is a constant y such that the time steps ky satisfy 
kn-1Sykn, n=2,..., N and let U" he the solution of (8.37). Then there is 
a constant C depending only on y and the constant B in (4.1) such that for 
n=l, ..,N, 


(8.41) |lu(t,)—U"ll CU + log?) max f [fU(siids 


ment, 


+max h?/lu(t)/|H22))- 
(Sly 
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Absorbing the “almost bounded” logarithmic quantity in the constant C, and 
using the trivial fact that 


f ilutsylids ss kallili>. 1, 


where I[v|l«, j=sup |}v(s)||, we can write (8.41) alternatively as follows 
seJ 


(8.42) max | u(t)—U(t)lEC (max kpl|tl|>, 1, + max h?|lu(t)[ie(ay)- 
tel n<Nn tel 


Here of course the first term on the right hand side bounds the time 
discretization error and the second term the space discretization error. 

Suppose now 6>0 jis a given tolerance and that we want the time 
discretization error in (8.37) to be bounded by & for all tel (ct Problem 8.7). 
By (8.42) we see that this will be the case if 


(8.43) kalltillx. 1, 2. n=l,...N, 


with C the constant in (8.42). This gives a rule for choosing the local time step 
k, according to the size of ||tl]», :,. Of course, ||tile, 1, is not known in 
advance, but it seems plausible that one would have 


(8.44) Kal| ull, 1,~ U"-UN 1, 


and thus we arc led to the following criterion for correct choice of time steps 
for (8.37) involving only the computed solution U: 


(8.45) jure tad. 


One may satisfy this criterion at each time step ¢ g by trial and error as follows: 
With U"~! given, a first guess of the next time step ky is made (eg kn=kn-1) 
and a corresponding U" is computed. If ||[U"-U"~]| is sufficiently close to 
S/C, then U" is accepted and otherwise ky is decreased or increased to make 
|[U"-U9"{|=8/C. Variants of this procedure are possible. For cxamplc, as 
initial guess for k, one may choose &k,q—/(C ||U"!-U" II). 

Under reasonable assumptions one can show that (8.44) is correct (except 
possibly for a few initial steps) and thus one can prove that if the computation- 
ally verifiable criterion (8.45) is satisfied, then the time discretization error 
in (8.37) will be bounded by 5. A typical behaviour of ||u(e){] for t moderatcly 
small, is given by ||W(t)||~C1-*, where 0<a< I (cf Section 8.2). In such a case 
the method based on (8.45) will thus automatically choose the correct time 
step sequence ky=Ou/C (cf Example 8.1 below). 
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Returning to the space discretization error in (8.42) let us notice that in 
(8.37) it is possible to use different spaces Vf for the space discretization on 
different time intervals I,. The error estimate (8.42) also holds in this case, 


if for example Vic Vi! with now the space discretization error bounded by 


(8.46) © max h2 max Jlu(s)i|H'(a), 
nsN sel, 


where h, is the mesh size in V#. The condition Vic Vi! may naturally be 
satisfied in the usual situation when u becomes smoother as t increases. 

For simplicity assume now that f=0 in (8.2). By our assumption that Q is 
convex, we then have by (8. 2a) and (4.27) that |lu(t)|livqaysCi|u(t)]]. To also 
control the space discretization error to the tolerance 5. we are therefore led 
to choose h, depending on n so that 


, a) 
(8.47) hi |{ttl| 22, 1,2° rom 
Again we may estimate the unknown quantity |[ul]., 1, using (8.44). 
For the method (8.38) one may. under the assumptions of Theorem 8.2, 
prove an error estimate of the form 


(8.48) max |{u(t)—U(t)||<C max (k2 |[u2)|]0.1,4+h2 max ||u(s)i{wca)), 
te n sel, 


2, 
where u2=9"" and where € contains a logarithmic factor as above. We also 
dt? e 


have the estimate 


(8.49) max |Ju(ta)-U TI <C max (K3] Auf} 21, 
: n 


+h, max ||u(s)|hr(s2))- 
sel, 
which shows that (8.38) in fact is third order accurate in time at the discrete 
time levels ty. 
Relying on (8.49), we are led to control the time discretization error in 
(8.38) as follows: 


(8.50) KY Aue =2. 
Po€. 

Again the unknown quantity || Au)||._ 1, may be estimated using the com- 

puted solution U. The method (8.38) with time step control (8.50) will 

typically require much fewer time steps than (8.37) with time step control 

(8.45). Although (8.38) is more costly per step than (4.37), in general (8.38) 

is much more efficient than (8.37) (cf Example 8.1), 
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In this section we have briefly indicated some important aspects of the 
problem of time discretization of parabolic problems. Of particular interest 
here is the discontinuous Galerkin method for which almost optimal error 
estimates can be obtained. These may be used as a basis for the design of 
rational methods for automatic time step control. With classical methods and 
techniques this was not achieved. For more information on this topic, see [J3], 
{EJ1), [INT], [JEL]. 


Remark 8.] The proof of the error estimate (8.41) for the backward Euler 
method (8.37), ic, the discontinuous Galerkin method (8.36) with q=0, is 
analogous to the proof of Theorem 8.1 and is again based on a duality 
argument. Firsl, we introduce the interpolant Ue Wp, defined by 


(8.51) J a(U-u, v)ds=0 VveWnk, a=l,. ..N, 

1, 
ie, 

W=sUl= +f an(syds, 

kni, 

where dy(s)e Vp is given by (8.16). Now, let Zc Wax satisfy 
(8.52) A(w, Z)=(wN, UR-UN) YweWhk, 
ie, Zis a backward Euler approximation of the solution z of the problem: 

—2-Az=0 in 2X1, 

2(T) = US-UR, 


Now, taking w=U~Ue Whe in (8.52) and using the fact that 
A(u, Yy=L(y) Wve Wn, 

we get, writing 9=u—U and recalling (8.51), 
{{U8-U|/2=A(U-U, Z)=A(u-U, Z) 


N N 
= ZS ((8, Z)+a(u-U, Z))ds+ EO, Zty 


n=l, 
is) Nol ul a 
+84, ZQ)=— = (82, [27 +(0%, 2%), 
o=1 
This gives the error representation formula 


(8.53) LUS-ONP-@Y,2%)~"E 4. [z")). 
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Now, corresponding to (8.18) and (8.19), we have the following stability 
estimates tor (8.52) (ct Problem 8.6): 


(8.54) |Z l<|JUN-ON]], n=1,....N, 
N-1 Te a 
(8.55) = [I[Z*]|| = CU+tlog —) \JU8-ON)). 
n=l kn 
Together with (8.53) these estimates prove that 
T 12 a 
{Ju(ty)-UN]| <C (14+log —) — max|lu(t)-U(0ll, 
kn tel 


which proves (8.41) for n=N. Since N may be replaced by n for n=1,..., 
N, we thus obtain (8.41) by estimating ||u(t) —U(t)|| as in the proof of Theorem 
8&1. 0 


Remark 8.2 The stability estimates (8.54) and (8.55) for the discrete auxiliary 
problem (8.52) used in the above proof, correspond to the estimates (8.5) and 
(8.6) for the continuous problem (8.7). In particular, the near optimality of 
the error estimate (8.41) is a result of the use of the strong stability estimate 
(8.55). Conventional (non-optimal) error estimates for (8.37) only use the 
weaker stability estimate (8.54). 


Remark 8.3 Note that the constant C in (8.41) in particular is independent 
of ty. This means that we may compute over very long time intervals essentially 
without growth of the global error. This reflects the parabolic nature of our 
problem. UU 


Remark 8.4 If we apply the backward Euler (i=1), the Crank-Nicolson 
method (i=2) and the discontinuous Galerkin method with q=1 (i=3) to the 
scalar problem 


Ht+An=0, t>0, 


(8.56) 1(0)=1P, 
where A>, we get the following lime stepping methods 
(8.57) B= r(k pS), i=1,2-3, 
where for x>0, 
in% (=% 
n= nbd 2, eo=—F. 
145 Tips kh 
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The relations (8.57) should be compared with the following relation satisfied 
by the exact solution of (8.56): 


A(ta=eK™ H(ta-1), 


Here of course the r(x) are rational approximations of the exponential e7*; 
we have that 


e-*=n(x)=0(x'+!) as x 0, 


corresponding to the fact that the order of the method i is i, i=1, 2, 3. We 
also have 
jrQx)]s1 for x20, i=1, 2. 3, 
and 
n(x) +0 as x— % for i=I and 3, 
but 
(8.58) r(x) > — 1 asx %, 


By (8.58) we have that the rational function r(x) associated with the 
Crank-Nicolson method docs not behave like the exponential e~* for x large. 
This means that the Crank-Nicolson method is not suitable for use in time 
intervals where the exact solution iy non-smooth, fur example in initial 
transients, where components corresponding to large eigenvalues 4 play a 
role. 


Example 8.1 In Fig 8.1 we give results obtained by applying (8.38) with 
variable spaces Vp to the problem (8.3) with [=0 and up(x)=1, O<x<z in 


which case [}u(t)|| ~Ct-# as t > 0 with ant, cf Section 8.2. The space and 


time step control was monitored by computational forms of (8.47) and (8.50). 
The number of space steps was restricted te be of the form 2™, m=0,1.... - 
We sce that the error |!e(t)I|L,w. «9 is, up to a factor 2, constant in tine and 
of the order 0.28=0,001. Also notice that the time and space steps vary 
considerably in time and that the total number of steps N ~ 30. This example 
is taken from [EJL]. 
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Error Problems 


8.1 Consider the one-dimensional parabolic problem (8.3) with f=0. 
0, DUE Ss Tolerance 


(a) Prove using (8.4) that if u(x)=x-x, 0<x<z, then 


* 
[laqpilece 4 ast 0. 


eons 


(b) Discretize in space using piecewise linear functions on a uniform 
partition with space step h and determine the corresponding 
ordinary differential equation (8.8). Determine the constant C in 
the stability requirement (8.34) for the forward Euler method with 
uniform step size k. Hint: The eigenvectors of the matrices B and 


0. 008 


i 0.002 


oud A in this case are given by W=(vj, . . ., viy) with 
Fs s WW ny 
vicsin¢ : M= —-l. 
bee ' ( M+1 h 
9.0 


Also determine the condition numbers of the matrices B and A. 


8.2 Let n be the solution of (8.11) with g=0 given by (8.20). 


I (a) Prove that 

| 

l 0 

| Spacesteps, timesteps, time ('9log-seale) 4A 2 C|n®| 10. 

| in(!+|An(yl = > 

(b) Using (a) and the fact that |AE|<Ch~2|§] prove that 
ale “3 

ae ; HAF Mog n J (nG)/+1An@)pds<c C+! log Bl) In 
0 


8.3 Suppose the time steps ky for (8.37) are chosen according to (8.43) 
with a non-smooth solution u satisfying ||u(t)}}~Ct7!** for some e>0, 


-2 


; 8.4 Propose an efficient method for solving the problem (8.22) related to 
sist aime the backward Euler method (8.21). 


| | Compute the number of steps required if T=1 and compare with the 
i se number of steps required in the case of a smooth solution satisfying 
—— spacesteps |Ja(t)||~C. 
| =B ——  timesteps 
ii 
| 
| 


~8 
8.5 Compare computationally the methods (8.37) and (8.38) for the 


problem considered in Problem 8.1 with varying degree of smoothness 
of the initial data ug. Use (8.45) for the time step control for (8.37) 
and (8.50) for the method (8.38). Compare with results obtained using 
a constant time step. 


Fig 8.] Fully discrete approximation of parabolic problem with automatic time and 
Space step control 
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8.6 


8.7 


8.8 
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Prove for the method (8.37) with f=0 and under the assumption 
knSCkp-), that 


(a) [[U||<]Ju%[, n=1,...,N, 


N nopjn- 
(b) (Eto |) POY) cli, 


N Rpn-l ; v2 
() Ep PO ec (ittog Ly uy, 
acl k k; 


n 


Also extend (a) to (8.35). 


Hint: For (b) choose v=t,(U"-U"~!) in (8.37). For (c) use (b) and 
Cauchy's inequality. 

Using the error representation formula (8.17) prove the following 
variant of the estimate (8.14) for the semi-discrete problem (8.8) with 
f=0: 

£ 
t 


Ss 


||u(t)—un(t)| 


v2 
J\l9(s)lids + CU + Hog al ymax {[6(s)|I 


7st 
2 
<C(1+| log Al) Elna. 


This estimate shows that the error has optimal order for t bounded 
away from zcro cven for non-smooth initial data. In other words, to 
have a small error for t away from zero it is (for linear problems) not 
necessary to resolve an initial transient completely. Similar results hold 
for (8.30) and (8.38) (cf [Th]). 


Consider the following time-dependent variant of the convection- 
diffusion problem (2.23): 


du ou au 
at HAutBi a thea 


=fin xl, 


u=0 on [x], 
u(x,0)=u", xeQ. 


Extend the methods (8.8), (8.21), (8.24), (8.35) to this problem and 
prove in the case f=0 a stability inequality analogous to (8.23). 


9. Hyperbolic problems 


9.1 Introduction 


In previous chapters we have scen that the finite element method applied to 
linear elliptic and parabolic problems produces numerical methods with very 
satisfactory properties. We now turn to problems with mainly hyperbolic 
character, such as eg convection-diffusion problems with small or vanishing 
diffusion. Problems of this form typically occur in fluid mechanics, gas 
dynamics or wave propagation 

It was observed early on that, in contrast to what is the case for eHiptic and 
parabolic problems, standard applications of the finite element method to 
hyperbolic problems lead to numerical schemes which frequently do not give 
reasonable results. More precisely, it was observed that standard finite 
element methods for hyperbolic problems do not work well in cases where 
the exact solution is not smooth. If the exact solution has eg a jump 
discontinuity, then the finite element solution will in general exhibit large 
spurious oscillations even far from the jump and will then not be close to the 
exact solution anywhere. This is of particular concern since in many interesting 
hyperbolic equations, the exact sulution is not smooth. Only recently has it 
been possible to overcome these difficulties and construct modified non- 
standard finite element methods for hyperbolic problems with satisfactory 
convergence properties. In this chapter we will present these new finite 
element methods, the streamline diffusion (cf Remark 9.9) and discontinuous 
Galerkin methods, and compare them with standard methods. These new 
methods apply to first ordcr hyperbolic problems such as eg convection- 
diffusion problems with small diffusion. We will also briefly discuss standard 
finite clement methods for a second order hyperbolic problem, the wave 
equation for the Laplace operator. In this case improved finite element 
methods are still to be discovered. 
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9.2 A convection-diffusion problem 


As an example of a linear hyperbolic-type equation let us consider the 
following convection-diffusion equation: 


9.1) 32 sdiv(uB)+ou-eAu=0 in Q&1, 


Here u is a scalar unknown representing a concentration for example, 
B=(B1, . . -, Ba) is a given velocity field, o an absorption coefficient. 20 a 
diffusion cocfficient, QCR* and I=(0, T) is a given time interval, The 
equation (9.1) is of mixed hyperbolic-parabolic type with more or less 
hyperbolic or parabolic characicr depending on the size of e and B. We assume 
here that ¢ is small, which means that (9.1) has mainly hyperbolic nature (if 
€ is not small then the material in Chapter 8 applies, cf Problem 8.8). In 
particular, with e=0 we have the following purely hyperbolic equation: 


du 


(9.2) at t+div(uB)tou=0 in QxI, 


or equivalently 
(9.3) a +B. Vutyut0 in OXI, 


where y=o+div 3. Let us briefly study this purely hyperbolic equation and 
first consider a stationary situation with u and B independent of time t, ie, 
let us consider the following equation 


(9.4) B- Vu+yu=0 in Q, 


where B=A(x) and y=y(x) are given coefficients. The streamlines correspond- 
ing to the given velocity field B=(fi. . . .. Ba) are given by the curves x(s), 
x=(X},..., xq), where x(s) is a solution of the following system of ordinary 
differential equations: 

= Bin) izle. ad. 

These curves, parametrized by the parameter s, are also called the characte- 
ristic curves (or characteristics) of the problem (9.4). Assuming that p is 
Lipschitz-continupus (ie [B(x)—B(y)|SC|x—y| Vx, ye Q, for some constant C), 
there is for a given point x¢ Q exactly one characteristic x(s) passing through 
A, ie, there exists a unique function x(s) such that (cf Fig 9.1) 


(9.5) SE = B(x), inl. d, 
x(0)=x 
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Fig 9.f 


We now observe that if x(s) is a characteristic, then by the chain-rule we have 


d d i 2 au 
as =r = F—fp- Vu. 
ds (ubx(s)) = 8x, ds i=1 3x; BSpeyu 


so that by (9.4) 
(9.6) < u(x(s)) +yu(x(s))=0. 


Thus, along each characteristic the partial differenual equation (9.4) is 
reduced to an ordinary differential equation. If the concentration u is known 
at one point on a given chaacteristic x(s}, then u can be determined at other 
points on x(s) by integration of (9.6). As an example let us assume that u is 
given on the inflow boundary T- defined by 


T_={xel n(x)- B(x) <0}, 


where [is the boundary of 92 and n(x) is the outward unit normal to I at xe. 
(cf Fig 9.1). The concentration u at an arbitrary point x in Q can then be 
determined by integration along the characteristic passing through X starting 
on [_. In particular this means that in the problem (9.4), effects are 
propagated precisely along the characteristics. 

It is important to nutice thal a solution u of (9.4) may be discontinuous 
across a characteristic. For instance, if the given concentration v on [_ has 
a jump discontinuity at some point Xe, then the solution u will be 
discontinuous across the entire characteristic passing through x. As a simple 
example let us consider the following problem in R?: 
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Su 


3x; =U in {xeR2:0<x,<1}, 


u(0, x2)=1 for 0<a0<3, u(0, x2)=0 for $<m<l, 


corresponding to taking Bi=1, B.=0 and y=0 in (9.4). Clearly the solution 
to this problem is given by (cf Fig 9.2). 


u(X, X2)=1 for O<xr<l, O<x1<1, 


u(x), X2)=0 for $url, O<xi<1, 


Fig 9.2 


Let us now return Lo the time-dependent problem (9.3). If we here replace 
t by xo and Iet Bg=1, then this equation can be written: 


ul su 
9.7 = 6) —+yu= 
t ) i=0 Bi Ox, eo 


This is formally an equation of the same type as (9.4) and the discussion 
concerning (9.4) also applies to the equation (9.7). In particular, the 
characteristics of (9.7) are the curves (x(t), 1) in space-time, where x(t) satsifies 


dx; F 
cane i(X ,t) i=l, .,d, 


(here the parameter s is chosen to be equal to t=xp corresponding to the 


~ Xo, 
equat ——=pPo=1). 
quation == Bo=1) 


Remark 9.1 Another equation of the same form as (9.2), although nonlinear 
since here the velocity is unknown, is the continuity equation (or principle 
of conservation of mass) in gas dynamics: 
Cligere 
—+div =0, 
3t (gv) 
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where 9 is a density and v a velucily. This equation. and additional equations 
expressing conservation of momentum and energy and a constitutive relation. 
constitutes a system of nonlinear hyperbolic equations which are the basic 
equations of gas dynamics. O 

We will also consider briefly the following generalization of the scalar linear 
hyperbolic equation (9.3): 


au, o du 
9.8 “+ A, ——+Bu= 
(9-8) or a : 8xj Bush 


where the A, and B are mXm matrices depending on x and t, the Aj being 
symmetric, and u is an m-vector. We say that (9.8) is a linear symmetric 
hyperbolic system. In particular we shall consider a (positive) Friedrichs’ 
system which is an cquation of the form (9.8), together with certain boundary 
and initial conditions, satisfying a positivity condition. A simple example in 
one space dimension of a system of the form (9.8) is given by 


ur 
uz 


guyf 9-1] aug ye 
St -1 0] 38x 


which is another way of writing the wave equation 


Bw _ B?w_ 
at ax? 
using the notation ws, u2= oe 


In the case of one space dimension (ie with d=1), asystem of the form (9.8) 
together with appropriate boundary and initial conditions can be solved using 
the method of characteristics. In this case there are m characteristics (x'(t), 


t)i=1,..., m, through each point x. satisfying the equations 
as nod) i=... m, 
dt 
where the Aj(x, 1), i=1, . ., m, are the eigenvalues of the matrix Aj(x, 0. 


9.3 General remarks on numerical methods for 
hyperbolic equations 


Common methods for the numerical solution of hyperbolic equations are of 
the following types: 


HI 


~ method of characteristics, 
~ finite difference methods, 
— finite element methods. 


The method of characteristics may be used for scalar linear equations in 
several space dimensions and for linear systems in one space dimension. In 
practice this would come down to solving first the equations for the 
characteristics and then integrating along the characteristics, in both cases 
using some numerical method for integrating ordinary differential equations. 
In principle this is a very good method but it may not be so easy to use in 
practice, particularly not for a system. Further, for a mixed hyperbolic - 
parabolic equation like (9.1), this method cannot easily be used. For these 
more gencral problems one usually uses finite difference or finite element 
methods hased on a fixed mesh, ie, a finite difference or finite element mesh 
that is not adapted to fit the characteristics of the particular problem to he 
solved. The use of a fixed mesh gives mcthods which are easy to program but 
it also may cause numerical difficulties if the exact solution is non-smooth with 
©g a jump discontinuity across a characteristic. In such a case conventional 
finite difference or finite element methods will produce approximate solutions 
which cither oscillate (as standard Galerkin or centered finite difference 
methods) or excessively smeur out a sharp front (as do classical artificial 
diffusion methods), see Fig 9.3. 
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Thus, conventional methods arc lacking in either stability or accuracy. Below 
we shall present the recently introduced streamline diffusion method and the 
discontinuous Galerkin method which at the same time have high order 
accuracy and good stability properties and which perform considerably better 
than the conventional methods (cf Fig 9.3 and Examples 9.2, 9.5 and 9.6 
below). 


9.4 Outline and preliminaries 


We will first consider scalar problems of the form 


2 +div(Bu)tou-eAu=f (x,t)eQXl, 


(9.9) 
u(x, 0)=ug(x) xeQ, 


with the stationary analoguc 
(9.10) div(Bu)+ou—eAu=f in Q, 


together with boundary conditions, where Q is a bounded domain in R94, 
1=(0,T) is a time interval, and the coefficients o, e2U and B=(Bi1, . .. Ba) 
depend smoothly un (x,t) or x. We assume that 


(9.11) div B+oza in QxI, 
where a2() is a constant with a@>0 in the stationary case. This condition will 
ensure the stability of the problems (9.9) and (9.10) for all e20 (for € small 
(9.11) can be relaxed, see eg [Na]). The boundary conditions may be of 
Dirichlet, Neumann or Robin (third) type. For simplicity we will consider two 
model problems with constant coefficients and Dirichlet boundary conditions, 
one stationary and one time-dependent. We leave the straight forward 
extension to variable coefficients and other boundary conditions to the 
problem section. 
We shall consider the following finite element methods: 


Standard Galerkin 

Classical artificial diffusion 

Streamline diffusion 

Discontinuous Galerkin 

Time discontinuous streamline diffusion. 


moOm> 
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The methods A, B and C apply to stationary mixed elliptic hyperbolic 
equations of the form (9.10) with e small while method D is designed for purely 
hyperbolic problems of the form (9.9) and (9.10) with e=0. Finally. E is 
intended for the time-dependent problem (9.9) with e>0. 

To conclude the chapter we shall discuss the application of the above 
methods ty the case of Friedrichs’ system and also consider some methods for 
second order hyperbolic equations, such as the wave equation. 

We now state the two model problems to be discussed below. Let then Q 
be a bounded convex polygonal domain in R? with boundary T and let 
B=(61,62) be a constant vector with [B/=1. We shall consider the following 
stationary boundary value problem: 


-eAutugtu=f in Q, 


(2) u=gonP, 


where & is a positive constant, and vg—fB- Vv denotes the derivative in the 
B-direction. The corresponding reduced problem obtained by setting e=U 
reads: 


uptu=f in &, 


(9.15) u=gonT_. 


where T_ is the inflow boundary defined by 
T_={xeP: n(x)-p <0}, 


where n(x) is the outward unit normal to [’ at the point xel’ ‘The 
characteristics of the reduced problem (9.13) are straight lines parallel to B 
(cf Fig 9.4). We notice that in the reduced problem the boundary values are 
prescribed only on the inflow part [_. 


Fig 9.4 
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Let us briefly recall some basic facts concerning the regularity of the exact 
solutions u of (9.12) and (9.13), As already noted, the solution of the reduced 
problem (9.13) may be discontinuous with a jump across a characteristic if 
the boundary data g is discontinuous, for example. In the full problem (9.12) 
with e>0, the solution is continuous in @, and such a jump will be “spread 
out” on a region of width 0( Vc) around the characteristic. Such a narrow 
region (for e small) where u (or some derivative of u) rapidly changes, is called 
a layer. If the values attained by the solution u of the reduced problem on 
the outflow boundary P,=I\I_ do not coincide with the boundary value g 
specified in the full problem, then the solution of the latter problem will have 
a boundary layer at C4. The thickness of this layer will be O(e), ef Fig 9.4. 

Let J=(0. 1) be a space interval, 1=(0, T) atime interval, and Q—Jx1. Then 
the time-dependent model problem is as follows: 


Ut Uy — EUxx=f in Q, 
(9.14) u(x, 0) =up(x) xeJ, 
u(x,t=g2(x,0) x=0.1, tel, 
with the corresponding reduced problem: 
uytuy=f in Q, 
(9.15) u(x,0)=uo(x) xeJ, 
u(0,t)=2(t) tel. 


Clearly the problem (9.15) has (except for the u-term) the same form as (9.13). 
‘Uhe characteristics of (9.15) are straight lines in the (x, t)-plane with direction 
(1, 1) and the inflow boundary is given by the points (x, t) with x=0 or t=O 
(cf Fig 9.5). 


Fig 9.5 


We use the following notation when discussing the methods AD tor the 
stationary problems (9.12) and (9.13): 


(v, w)=Jvw dx, (Vv. Vwy=fVv- Vw dx, 
Q a 
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Well= fica, Ivlls={lvlhieray, 
<v, w>=Jvw n-6 ds, 
i 


<v,wou=fvwn-Bds 9 <v, w>,=Jfvwn-B ds, 
r te 


Iwl=(fyein = Bids), 
Tr 


where [4=P\P_-={xeIin(x)- B20}. We notice that by Green’s formula 
(5, W=<¥, W-(v. wp). 


Further let {T,} be a family of, for simplicity, quasi-uniform triangulations 
Tn={K} of Q with mesh size h which satisfy as usual the minimum angle 
condition. For a given positive integer r we introduce the finite element space 


Vn={veH)(2): viKeP(K) VKETp}. 


ie, Vp is the space of continuous piecewise polynomial functions of degree 
r. From the approximation theory of Chapter 4. we have that for any 
uéH'*!(£2) there exists an interpolant dhe V) such that 


(9.16a) — |lu-ahif<Chi* lulls. 

(9.16b) {ju dP|])=Ch julie a. 

Moreover, if the derivatives of u of order r+1 are bounded on Q, then 
Ju-dhl<cCht!, 

and with somewhat less stringent regularity requirements (sce [Cil) 

(9.16c) ju-O"|sCh 7 {jul], ay. 

In the proofs below we will often use the inequality 

2absea?+e7'b?, 


for a, b real numbers and e>0. 


9.5 Standard Galerkin 


Let us first consider the standard Galerkin method for the stationary model 
problem (9.12) with e>0 and let us then for simplicity assume that the 
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boundary data g is zero. This problem can be given the following variational 
formulation: Find uel {}(Q) such that 


(9.17) (Vu, Vv)+(ugtu,v)=(f.v) Wwe HQ). 
Let now the finite element space 
VrelveVa: v=0 on F} 


be given. The standard Galerkin method for (9.17) reads: Find u®e Va such 
that 


(9.18) e( Tut, Vv) +(uh+ub, vy=(£v) Wve Vy. 


This method will perform well if c#h, but if e<<h then this method may 
produce an oscillating solution which is not close to the exact sulution. To get 
an idea of what may happen, let us consider the following simple one- 
dimensional example: 


Example 9.1 Consider the boundary value problem 


—eu,,+u,=0, J<x<l, 


(9.19) u(O)=1, u(1)=0, 


with U<e<<l. The solution of this problem is given by 


wick ak 
u(x)=a(l-e  * ), a=(I-e Fp! 
For ¢ small u(x) is close to | except in a layer at x=1 of width Q(e) where u 
decays from 1 to Q, see Fig 9.6. 

If we apply the standard Galerkin method with piecewise linear functions 
on a uniform mesh with mesh length h to (9.19), we obtain the following 
system of equations for the values U; of the finite clement approximation up 
at the gridpoints xj=ih, i=0, 1, ..., N, where xn=1: 


— Fy[Uin 204 Ui-d+ Fa [Urs Ui-r}=0, iy: NEY 
(9.20) 
Up=1, Un=0. 


We notice that (9.20) may also be viewed as a difference scheme with acentral 
difference approximation (U;,.;—Uj-,)/2h for the convective term ux. Now, 
if N is odd and ¢ is very small, then the solution U; of (9.20) is approximately 
equal to 1 fori even and equal to 0 fori odd, and we get a sulution that oscillates 
in the whole region and that is not close to the exact solution (cf Fig9.6). O 
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Fig 9.6 Exact solution and approximate solution by the standard Galerkin method 
(solid lines) for (9.19) with ¢=0.01, h=/1I, (cf also Problem 9.5} 


To sum up, the standard Galerkin method (9.18) may produce an oscillating 
solution if ech and the exact solution is non-smooth. However, if the exact 
solution happens to be smooth, then the standard Galerkin method will 


produce good results even if e<h (cf Problem 9.2). 
Let us now turn to the reduced problem (9.13) with e=0. We will consider 
two variants of the standard Galerkin method. 


Standard Galerkin with strongly imposed boundary conditions: 
Find ueV, with u"=g at the nodes on T- such that 

(9.21) (upt+ub, v)-(f, v) Wve Vp with v=0 on T 
Standard Galerkin with weakly imposed boundary conditions: 
Find u®eV), such that 

(9.22) (utub, v)—<uh, v>-=(Ly)-<gv>- Wen. 


Let us analyze the method (9.22) (we leave the method (9.21) to Problem 
9.3). 
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Introducing the notation 
b(w, v)=(watw, v)-<w, v>_. 
lv)=(fv)-<g.v>-, 
this methods reads: Find uyeV;, such that 
(9.23) b(u", v)=/y) WEVh. 
Since the exact solution u satisfies (9.13), we clearly also have 
btu, v)=/(v) VveVn, 
and thus by subtraction we get the following equation for the error e=u—up: 
(9.24) be, v)=0 Wevn. 
The stability of the method (9.22) is a consequence of the following property 
of the bilinear form b; 
Lemma 9.] For any veH'!(Q) we have 
(vw) =|IMP+5 IP. 
Proof By Green's formula 
(va.vJ=—(yvg)t<vv>, 


so that 
(vp.¥) - devo: 
2 
Ilence 
b(v,v)=||v||2+ ews + fevy>- —<VVv>_= 


=||lyiz4 1 =Iyit4dyl2 
=||v}| +5 Sve 73 <vre—= lvl +5lvl i 


since n-B20o0nT, andn-B<0onP_. O 


Since (9.22) is equivalent to a linear system of equations with as many 
unknowns as equations, we obtain uniqueness and hence also existence of a 
solution from Lemma 9.1. Let us now prove an error estimate for the standard 
Galerkin method (9.22). 
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Theorem 9.1 There is 4 constant C such that if u satisfics (9.13) and uve Vy, 
is the solution of (9.22), then 


(9.25) |Ju—uh|j+|u—u] s Ch']lulieer. 


Proof Let GeV; he the interpolant of u satisfying (9.16) and write y=u—a" 
and e>=uh— ah so that eb=n>--e, where e=u—u". By Lemma 9.1 and (9.24) 
with v=ebeV),, we have 


eM [2+ JierP=b(er, e)=b(n, e”)—b(e, eF), 
=b(nh, eb)=(nf, eb) + (nh, elt)—<nb, elt 

1 il 
={Inpll?+|Inll2+ plletlP+ P+ glen. 


Recalling (9.16) we have that 

[Impl [nPil+ yhl<Ch'ljull, a1. 
and thus 

{febl| + leh <Ch fuller. 


Since e=e"—n,, the desired inequality now follows from the triangle 
inequality and the proof is complete. OD 


The estimate (9.25) proves that if the exact solution u of the reduced 
problem (9.13) happens to be smooth so that ||ul||r1 is finite, then the standard 
Galerkin method (9.22) will converge at the rate O(h"). Although this rate is 
one power of h from being optimal, it shows that the standard Galerkin 
method will perform rather satisfactorily in this case. However, in general u 
will not be smooth and in this case the standard Galerkin method gives poor 
results (the error estimate (9.25) is uscless, for example, if u is discontinuous 
since then |lu||;=). 


Problems 
9.1 For r=l1, 2, estimate the norm |{ul|, of the solution u of the 
one-dimensional problem (9.19) in terms of e. 


9.2 Prove an error estimate for the standard Galerkin method (9.18). 
Explain why this estimate does not necessarily imply that this method 
performs well for € small. 


9.3 Prove an error estimate for the method (9.21). 
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9.4 Consider for e>0 the problem 


—eAutdiv(pu)+ou=f in Q, 
u=0 inT, 


with variable coefficients B(x) and o(x) satisfying the condition (9.11) 
with a>0. Formulate the standard Galerkin method for this problem 
and prove a stability estimate. 


9.6 Classical artificial diffusion 


The simplest way to handle the difficulties connected with the standard 
Galerkin method (9.18) with e<h and (9.21)-(¥.22) is to avoid these situations 
completely. This can be done either by decreasing h until e>h, which may 
impractical if € is very small, or simply by solving, instead of the original 
problem with diffusion term ~¢ Au. a modified problem with diffusion term 
—hAu obtained by adding the term —6Au, where 5=h-e. This is the idea 
of the classical artificial diffusion (or viscosity) method. To be precise. this 
method for solving (9.12) with e<h reads: Find ute Vp such that 


(9.26) h(Vul, Vv) +(ud+urv)=(fv) Wve Va. 


This method produces non-oscillating solutions but has the drawback of 
introducing a considerable amount of extra diffusion. In particular, this 
method introduces a diffusion term —huy, acting in the direction  perpen- 
dicular to the streamlines (‘‘crosswind” diffusion), and a sharp front or jump 
across a streamline will be considerably smeared out. Moreover, due to the 
added term —§ Au such a method is at most first order accurate, and the error 
is at best O(h) even for smooth solutions. 


9.7 The streamline diffusion method 


It turns out that to considerably reduce the oscillations in the standard 
Galerkin method (9.18) in the case e<h, it is sufficient to add a term —Bupe 
where 6=h—e, ie, a diffusion term acting only in the direction of the 
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streamlines. Such a modified artificial diffusion method would read: Find 
ute Vy such that 


(9.27) e(Yub, Vv) Fd(ub, vp) Ku tun, v=(L¥) Wwe Vn, 


where 6=h—e. This method introduces less crosswind diffusion than the 
classical artificial diffusion method (9.26), but still corresponds to an 
O(h)-perturbation of the solution of the original problem. 

However, it is possible to introduce the magic term d(up, Vp) appearing in 
(9.27) without such a perturbation. Let us first see how this may be done in 
the case e=0. 


9.7.1 The streamline diffusion method with ¢=0 


Let us start from the standard Galerkin method (9.22) with weakly imposed 
boundary conditions. If, in the terms (. , .), we replace the test function ve Vp 
by v+hvg, we pet the streamline diffusion method: Find w'eV;, such that 


(9.28) (ug+ut, v+hyg)—(+h) <uP, v>— 
=(fvthvp)-(+h) <gv>-  WveVh, 


where for convenience we have also multiplied the boundary terms by the 
factor (1+h). We notice the presence of the term h(up, vg). Further, we notice 
that the relation (9.28) is valid if we replace u> by the solution u of (9.13), 
ie, the method (9.28) is consistent with (9.13) and does not introduce an 
O(h)-perturbation as do (9.26) and (9.27). 

Let us now analyze the method (9.28) and introduce the following notation 


B(w,v)=(wa+wvthve)—(1+h) <www>_, 
L(v)=(f.v+hvg)—-(1+h) <g,v>-. 


The method (9.28) can then be formulated as follows: Find u%eV), such that 
Bu"y)=L(v) — WveVp. 

Moreover, as we have just noted, the exact solution of (9.13) satisfies 
B(u,v)=L(v) YveH(Q), 

and by subtraction we thus have the following error equation: 

(9.29) B(e,v)=0 Vvevn, 


where e=u-ul, 
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We will prove an error estimate in the following norm 
Illlp=Chllval?+ liv |e, 


This choice of norm is related to the following stability property of the bilinear 
form B(. , .). 
Lemma 9.2 For any vell'(Q) we have 
Biv,v)=|[vllp.- 
Proof By Green’s formula we have 
(ypv)=3 <vyv>, 
and thus 


Bivy)j=tP <vyv>—(1+h) <v,v>— F Hlv]/2 + bllv_ll? 


ath (<vv>4— <v,v> )4+/||vil2+h]|vall? 


_ith 
2 


which proves the desired equality. UL 


Iv/?+||vl|?-+h]|vall?. 


We can now prove an error estimate for the streamline diffusion method 
(9.28). 


Theorem 9.2 There is a constant C such that if u” satisties (9.28) and u satisfies 
(9.13), then 


(9.30) JJu—ubj[g< Chiu] lean. 
Proof Let G4eV;, be an interpolant of u satisfying (9.16). Writing as before 


yh=u—a> and eM=ub—Gh, and using Lemma 9.2 with v=e and (9.29) with 
v=uh~ii, we get 


llellf=Ble, e)}=Ble, n")—B¢e, eM) =Be. n!) 

=(ep. nP)+h(eg, nb) +(e, nP)+h(e, nf)—(L 1h) <e, nh 
< Bleg|rh-MnP|?+ Blfeg| +n] pl 2+ tihell2+ inh? 
+Hete+n2iigle+ CFM ep ec thy PP. 
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Recalling the approximation result (9.16), we thus have 


Weil < ch? Njuil2,.,, 


which proves the desired estimate. 


Remark 9.2 Notice that it is the presence of the term hlleg||? in the quantity 
Hell dominated by B(e, e) that makes it possible to split the critical term 
(eg, nh) into one term h|lep|i2/4 that can be “hidden” in lle\lp» and one term 
ho'[In*|/? that eventually will produce the factor h'*!”2, Thus, the streamline 
diffusion method has “extra stability” in the streamline direction as compared 
with the standard Galerkin method where the form b(v,v) dominates only the 
Le-norms ||v||? and |v|? and where the quantity \Inpil appears in the proof of 
the error estimate (cf the proof of Theorem 9.1). G 


The error estimate (9.30) for the streamline diffusion method (9.28) states 
that 


|Ju-ublissChr*!? fullest, 


{@up—up)||<Ch]lu)fp 


Thus, the Ly-error is half a power of h from being optimal (cf (9. 16a)), while 
the Lg-error of the derivative in the streamline direction is in fact optimal. 
These estimates indicate that the streamline diffusion method (9.28) should 
be somewhat better than the standard Galerkin method (9.22) if the exact 
solution is smooth, but they do not explain the dramatic impruvement one 
actually finds when the exact solution is non-smooth. The fact that the 
streamline diffusion method also performs well in this latter more difficult case 
is related to the fact that in this method effects are propagated approximately 
as in the continuous problem, ie, essentially along the characteristics. One 
can prove for the streamline diffusion method (see [JNP]) that the effect of 
a source at a certain point PeQ decays at least as rapidly as exp (-d/CVh) 
where d is the distance to P in directions perpendicular to the characteristics 
(“crosswind” directions), and like exp (—d/Ch) in the direction opposite to 
the characteristics (“upwind” direction), In particular, this means that the 
effect of eg a jump in the exact solution across a characteristic will be limited 
to a narrow region around the characteristic of width at most O(Vh) (in certain 
cases the width is improved to 0(h™), sce [JSW]). On the other hand, in the 
standard Galerkin method effects may propagate in the crosswind and even 
in the upwind direction with little damping (sce the discussion for the 
one-dimensional problem (9.19) in Example 9.1). 
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Remark 9.3 Note that for the cuntinuous problem (9.13) with g= for 
simplicity, we have the following stability estimate: 


lul+ [Jul [lal] sCilei]- 


This estimate follows by multiplying (9.13) by u which gives control of |u| and 
|lul], and the control of ug then follows through the equation ug=f—u. In the 
streamline diffusion method (9.28) the corresponding stability estimate, 
obtained by taking v=u and using Lemma 9.2, reads Hed) pC liei] or 


|u*(-+/[u'l|-+ Vi [lupl| = CilEil- 


This estimate is a weaker variant of the above estimate for the continuous 
problem with less control of the streamline derivative. In the discrete case we 
have no equation analogous to ug=f—u and hence control of \luphl does not 
follow from control of ||u4{|. Instead, in the streamline diffusion method, 
partial control of \Jup| is explicitly built in through the modified test function 
v+hyg. Notice also that in the standard Galerkin method the stability estimate 
reads |u"|+//u"l|<Cilf||, with no extra contro! of {tupll. Here one can only 
guarantee ||ug||ssCh~"Ju"||<Ch-"LF)| through an inverse estimate, cf 
(7.47). 0 


9.7.2 The streamline diffusion method with ¢>0 


Let us start from the stationary problem (9.12) with g=0 and h>e>0. 
Multiplying the equation —e Aut+ug+u=f by the test function v+dvg. where 
ve HQ), and integrating, we get 


—ed( Au,va) + e( Vu, Vv) + (ugtu.v+dve)=(f,v+ dvp), 


where the term —e( Au, v) has been integrated by parts. Here 6 is a positive 
parameter to be specified below. To formulate a discrete analogue of this 
relation by replacing u by ule Vy and restricting v to Vj, we have to give a 
suitable meaning to the term (Au, vg). since this expression is not directly 
well-defined for u*, ve Vhe The correct definition turns out to be simply the 
following in this case: 


(9.31) (Au, vg)= 2 J Aulygdx, 

Kel, k 
ie, we just sum the integrals over the interior of each triangle K where Aub 
and vg are well-defined. We now formulate the following streamline diffusion 


method for (9.12): Find ube Vp such that 
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(9.32) e( Vuh, Vv) —e6( Aug, vp) +(up+uhv+ Svg)=(f.v+ dye) 
we Vn, 


where 6=Ch if e<h with C>0 sufficiently small (see Remark 9.4), and d=0 
if ezh. Clearly, this is a consistent foimulation since (9.32) is satisfied with 
uy, replaced by u as we noted above. Vhe error estimate (9.30) and the 
localization results for (9.28) can be extended to the method (9.32) with e<h. 
To sum up, the method (9.32) is an answer to the problem of constructing 
a higher-order accurate method for (9.12) with good stability properties (cf 
Remaik 9.6). 


Remark 9.4 Let us give a proof of the basic stability estimate for (9.32) in 
the case e<h, which proves in particular that the presence of the term 
= c8( Aur, vp) does not degrade the extra stability introduced by the term 
(ug, vg) if the constant C is small enough. By the inverse estimate (7.51) we 
have for ve V, 


led Av. vel sjell J v2 Sed ?6}Iupll2 
so that with B,(. . .) denoting the bilinear form associated with (9.32). 
Bev, vie dell VoiP+ Iv? +0 5esc*h2) dllval2, for vey. 
Thus, if C is so small that 
e8C?h-2=C’Cehh aCe], 
then 
B.(y, ve teel Vvi|l2+ df] vp}? + {I¥tl?) for ve Va, 


which proves the desired stability result. 


Remark 9.5 Af {{y} is not quasi-uniform or |B| is variable, then in (9.32) we 


choose 6=Chy/|B| on KeTh if e<hx|f!, where hx is the diameter of K, and 
5=0 if e=hxIB]. 


Remark 9.6 As noted above the streamline diffusion method will capture a 
jump discontinuity of the exact solution in a thin numerical layer. However, 
within this numerical layer the approximale sulution may exhibit over- and 
under-shoots (cf Fig 9.3). Recently, in [IIFM] and [HM2], a modified 
streamline diffusion method with improved shock-capturing properties (re- 
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duced over- and under-shoots) was introduced. In this method the test 
functions are modified as follows: 

v+6B- Vvt5B- Vv, 
where 


ie Bis the projection of B onto ¥ u, Since B depends on the unknown discrete 
solution uh, this leads to a non-linear mcthod even though the underlying 
problem is linear, Further, as above 5=O(h/p|) and also 5=O(h/B)). For 
numerical results see Examples 9.2, 9.6 and 13.9. The problem of theoretically 
explaining the improved shock-capturing of the modified method is con- 
sidered in [Sz]. 0 


Remark 9.7 The streamline diffusion method for (9.12) and (9.13) is basically 
obtained by multiplication with test functions of the form vt+hvg where 
veVy. This means that the test functions belong fo a space which is different 
from the space of trial functions Vy where the discrete solution u) is sought 

Such a method, where the test functions are different trom the trial functions, 
is sometimes called a Petrov-Galerkin method. Note that in a standard 
Galerkin method the spaces of trial and test functions are the same (modulo 
boundary conditions). 


Example 9.2. Consider the convection-diffusion problem 
—eAutp: Vu=0 in 2, 
u=gonT), 
du 


—-00n [2, 
du 5 


where £= 1073, B=(cos 10°, sin 10°), Q={xeR?: Q<x;<1} is the unit square, 
Py={xeP: xg=1)} and Py=f 2 where [ is the boundary of Q. Further, g=1 
for 1/2<x <1, x1=0. and g=0 if x2<1/2 or xy=1. In Fig 9.7a we give the 
approximate solution of this problem obtained using the streamline diffusion 
method on the indicated mesh with 6=h and using piecewise linear basis 
functions. In Fig 9.7b we give the corresponding result using a small amount 
of shock-capturing (=0.15 h). G 


Problems 
9.5 Formulate the streamline diffusion method for the one dimensional 


problem (9.19) with e>0 as well as c=0. Determine the corresponding 
difference schemes in the case of piecewise linear trial functions and 
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Fig 9.7 Solution graph and level curves for streamline diffusion method for problem 
in Example 9.2 (a) without and (b) with shock-capturing 


a uniform partition. Make a computational comparison with the 
method (9.20) (cf Fig 9.6 where the thin curve gives the streamline 
diffusion solution in the case e=0.01, h=I/I1 and 8=2h/3). O 


9.6 Prove the error estimate (9.30) for the method (9.32) with e<h. O 


9.7 Generalize the streamline diffusion method (9.32) to the variable 
coefficient problem of Problem 9.4. Hint: Use eg the test function 
v+édiv(pv). O 
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9.8 The discontinuous Galerkin method 


Galerkin methods using continuous trial functions will lead to globally coupled 
systems of linear equations, ie, systems where a change in data at one node 
will (at least in principle) affect the solution at all nodes. This is natural in 
the elliptic problem (9.12) with c>O, but not so for the purely hyperbolic 
problem (9.13) with e=0. In this latter case it would be more natural to be 
able to solve the linear system by successive elimination starting at the inflow 
boundary [ 

We will now consider a finite element method for the reduced problem 
(9.13) which permits such a solution procedure and which has stability and 
convergence properties similar to that of the streamline diffusion method. This 
method, the discontinuous Galerkin method, may be viewed as a general- 
ization of the method with the same name in Chapter 8. It is based on using 
the following finite element space: 


W,={veLo(Q): vixeP(K) WKeTn}, 


that is, the space of piecewise polyomials of degree r#0 with no continuity 
requirements across interelement boundaries. 

To define this method let us tirst introduce some notation. For KeTp we 
split the boundary SK of the triangle K into an inflow part K_ and an outflow 
part SK. defined by 

8K_={xedK: n(x) -B<0}, 
8K.4={xedK: n(x) B20}, 


where n(x) is the outward unit normal to dK at xeK, (cf Fig 9.8). 


Fig 9.8 


Further, suppose S is a side common to two triangles K and K’ (cf Fig 9.8) 
and consider a function ve W), which may have a jump discontinuity across 
S. We define the left and right hand limits v_ and v. by 
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v.(x)= lim v(x+sB), 
s+ 0- 

Va(x)= lim v(x+sf), 
s— Or 


for xeS and we also define the jump [v] across $ by 
[vJ=va-v-. 


The discontinuous Galerkin method for (9.13) can now be formulated as 
seeking a function ube W,, according to the following rule: For KcTy, given 
u" on 8K~ find usu ly eP,(K) such that 


(9.33) (ull +a v)_- f uby, n-Bds=(fy)k- July, n-Bds, 
OK_ oK_ 


VveP,(K), 
where 


(w,v)K= wv dx, u" =e on TL. 
K 


To see that this problem admits a unique solution, note that (9.33) is nothing 
but the standard Galerkin method (9.22) with weakly imposed boundary 
conditions in the case of just one element. Thus, if ul is given on OK~ we 
know that ul, is uniquely determined by (9.33). Now, we can start to deter- 
mine u? on the triangles K with 8K_cI_ since then u'=g is given. This will 
then define u® on the triangles K next to, and we may continue this process 
until u™ has been determined in the whole domain (cf Fig 9.9 where the order 
in which u? may be calculated is indicated). 


To write (9.33) in more compact form suitable for analysis, note that (9.33) 
can be written 
Bx(u",v)=(f,¥)x VveP,(K). 


where 
Bu(w.v)=(watw.v)x—_f [wlyen- B ds. 
oK_ 


The discontinuous Galerkin method can now be formulated: Find ue W,, such 
that 
(9.34) B(u,v)=(f,v) YveWn, 


where 
Bwv)= = Bx(w,y), 


KeT,, 


and ul=p on [_. Clearly the exact solution u satisfies the equation 
B(u,v)=(f.v), YveWh (note that [u]n- B=0), and thus we have the error 
equation 


(9,35) B(u—ul,v)=0 YveWh. 
Before analyzing the method (9.34) in some detail let us consider the following 
examples. 
Example 9.3 Let us consider the one-dimensional analogue of (9.13), ie, the 
problem 
uxytusf for 0<x<l, 
u(0)=g. 
Let U=xo<x1 . . . <xn=l be a subdivision of I1=(0, 1) into subintervals 
I}=(xj, xj+1). The method (9.33) reads in this case (ef. (8.36)): For j=0, 1, 
- N=], given ul(x;)_ find ub=uh); eP,(1j) such thar 
f (ugtubyy dx+(u'(x))+—-ul(x;)-)v(q)e=ffvdx Wwe Py(l)). 
I 1 


f ' 


(9.36) 


where v(x)+= lim v(x+y) and u!(x9)-=g. In particular, for r=0 in which case 
y Oe 


ub is piecewise constant, we get the following method: Find Uj=u(xj)- such 
that 
Wet aig | ftux  j=0, ..,N-=1, 
hj hy i, 


Uo=s, 
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where hj+xj.1—x,, which is a simple finite difference method for (9.36). 
namely the upwind or backward Euler method. 


Example 9.4 Letus consider the method (9.33) in the case r=0 and to simplify 
further, let us assume that f=0 and also that the u-term is not present so that 
we simply have the problem ug=0 in Q, u=g on [_. The discontinuous 
Galerkin method then reads: For KeTp, given u® on JK find the constant 
Ux=u"|, such that 


— f Uk n-Bds=— fubn-p ds, 

OK_ OK. 

ie 

(9.37) Ux = fuln-B ds/ fn-Bds. 
OK_ oK 


In other words, for each K the value Ux is obtained as a weighted average 
of the values of u" on adjoining elements with sides on OK_. As an example, 
using quadratic clements in the following configuration 


and assuming that 8;>0, we find that 


8 Ba 
Uy= —— U) + Up, 
BitBr |” Bi+Bo * 
where Uj=Ux,. Again this corresponds to a simple difference scheme for the 
equation ug=0 (in fact this is a usual upwind difference scheme for this 
equation if we relate the value Uj to the midpoint of each Kj). O 


Let us now prove a stability inequality for the discontinuous Galerkin 
method (9.34) using the norm | -|s defined by 
1 


\iR=llvll2 + == $f [vPin- plds + J. fvin-B ds. 
2 ok. 26 
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Lemma 9.3 For any piecewise smooth function v we have 


By, Y=Win3 fv 2In- Bi ds. 


Proof By Green’s formula 
2(yp, = J vin-Bds— f viln- Bids, 
ak, 3k 
and thus 
2Biv,v)= Ef f vin-Bds— f v3|n-Blds 
K ak, aK. 
+2 iS. (v4—v—)valn- Bids} +2]}¥([2. 
Kk 
Since every side of 9K, coincides with a side of 9K“ for an adjoining element 


K’, except if OK4cIT4, and similarly with + and — reversed, we have 


x fvin-Bds== f v2[n-plds 
K 3k, K 3K 


+ fv2n-Bds— f v2|n-Blds, 
r, r 


and consequently 


ABiv,v)= U{ f (v>—-2w_v4+v?)|n- Bids) + 
K  aK_ 
+ fv2n-Bds— f v2|n-Bids+2|Iv\|2, 
r, ion 


which proves the desired result. O 


From Lemma 9.3 we obtain in the usual way existence and uniqueness of 
a solution to the discontinuous Galerkin scheme (9.34), and it is also possible 
to derive an error estimate which proves O(h‘) convergence in the | - |g-norm. 
However, this estimate is not the best possible. One can prove that if 8=Ch 
for some suitable constant C. then for veWh 


(9.38) BUv.v+ dvp)>ClvIIB-L v2|n-Blds), 


where 


Vj =ivem = Ulli, Nwle=Goem de. 
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Using this improved stability, it is possible to prove the following error 
estimate 
9.39) |lu-ul|psCh'*"*IIullrs- 


In the case r=0 we have ||vily=|v|p for ve Wn since here vg=0 on each K. Thus, 
for r=0 the stability of Lemma 9.3 should be sutfictent to obtain O(h!) 
convergence. Let us prove that this is in fact the case. 


Theorem 9.3 There is a constant C such that if u® satisfies (9.34) with r=0 
and u satisfies (9.13), then 


1 
ju-uPlg <Ch?|lulle. 


Proof Let teW, be the interpolant of u defined by letting bq be the mean 
value of u ver K for each KeT},, and Ict us write as usual y*=u—d". Applying 
Lemma 9.3 with v=e=u—u" and noting that c-=0 on T-, we get, using also 
the error equation (9.35) with v=un—Uh 


lel =B(e, ce) = Ble, v-u")+B(e, au") 
=Béen®) = Z (Cop, Wf Lelnta-B ds} +(e 
K . 
Now, es=(u—u")p=up on each K, since u" is piecewise constant, and thus by 
Cauchy's inequality 
(9.40) jel<aail Lint +Hlel! intl 
+(= f [ePin- Bids)!?-(2_f [nt Pla Bids)’. 
KaK_ K aK_ 


It is easy to realize that if ueC(Q) so that ae i=1, 2, is bounded on &, then 
i 


Max nh()|<Ch, 

xeQ 
and therefore, since the Jength of 6K- is 0(h) and the number of elements 
is 0(h72), we have 


= J nhP|n- lds = Ch3<Ch. 
K dK. K 


Thus from (9.40) we conclude, hiding terms as usual on the left hand side, 
that 

lel} Clluglih+Ch?+Ch<Ch, 
or 

lelps Chl?, 
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where the constant C depends on max {/D@u(x)|: |al=1, xe}. This proves 
the desired error estimate, modulo the fact that we have used a somewhat 
stronger norm on the exact solution u than stated in the theorem ft is in fact 


casy to sce that the norm ||ul|, is sufficient and this is left to the interested 
reader. O 


Remark 9.8 Suppose we stop the calculation of u* when ub has been 
determined on a subset Ty of Ty, eg on the triangles Ki,. ., Ki in Fig 9.10, 
and let Q’ be the union of the triangles in Th. Then clearly the error estimate 
(9.39) holds with Q replaced by '. In particular this means that we obtain 
error estimates in the weighted Lo-norm 


Jv2n-Bds, 
T3 


extended over the outflow boundary I, of each subdomain Q’, eg along the 


LRT 
[NPR 


RE 


Fig 9.10 


We now turn to the time-dependent mode! problem (9.15) with €=0. ie, 
the problem 


uptuy=f in 2=3K1=(Q, 1)x(0, T), 
(9.41) u(x,0)=uo(x) for xeJ, 
u(0,t)=a(t) for (el. 


As already noted this problem has the same form as the stationary model 
problem (9.13) with c=0. Thus, we can apply the discontinuous Galerkin 
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method using a triangulation of Q=J x1. ie, a triangulation in space-time. It 
is natural to consider a triangulation where the elements are organized in time 
as in the following example (note that triangulations in adjoining strips do not 
necessarily have to match): 


a 
4 
18 19 20 
15 16 7 
12 13 14 
8 9 10 11 
4 5 6 7 
1 2 3 
x 


Fig 9.11 


It is then possible to compute the discrete solution u! successively on one strip 
after another starting for each strip on the left and moving triangle by triangle 
to the right (the order in which uh may be computed is indicated in Fig 9.11). 

Conventional schemes for (9.41) are based on using separate discretizations 
in space and time. First a semidiscrete problem (an initial value problem for 
a linear system of ordinary differential equations) is obtained by discretizing 
the space variable using fmite elements or finite differences, and then a 
difference method is used to discretize in time. Ifowever, for the problem 
(9.41) there is really no reason to distinguish between x and t and it seems 
most natural to use space-time elements. 

‘lo sum up, the discontinuous Galerkin method has theoretical stability and 
convergence properties similar to those of the streamline diffusion method. 
In practice it turns out that when applied to eg (9.41) the discontinuous 
Galerkin method perfurms somewhat better than the streamline diffusion 
method. In fact, already for k=1 the discontinuous Galerkin for (9.41) 
performs remarkably well and we know of no (Jinear) finite difference method 
that is better (cf Problems 9.12, 9.15 and Example 9.5 below). 


Problems 

9.8 Determine the difference scheme corresponding to the discontinuous 
Galerkin method with k=! for the one-dimensional problem of 
Example 9.1. 
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9.9 


9.10 


9.1L 


Evaluate Ux in (9.37) in the case of triangular elements in a general 
configuration. Distinguish between the cases when 3K — consists of one 
and two sides. 


Let u® be the solution of the discontinous Galerkin method with k= 1 
for the problem 


ou_,. _ 
ago! in Q, u=g on! _, 


on the following triangulation: 


(0,2) (0,0)  *1 = (2,0) 


Suppose that u? on 3K) U8K2 is given by 


(0.49) ue, 0)=a1+ ix), —2<x)<0, 
u(x), 0)=a2+ Box), O0<x)<2. 


Determine u" on aK‘, ie. assuming that w(x), =544 Bax for 
—1<x)<1, determine a4 and By in terms of «4, Bi, m2 and Bo. Hint: 
h 
Prove first that Bao on K! and K? and then prove that 
x2 


1 
J uh Ga Dy dx; = 
“I 


a0 .Ov dx, for v=1, x;. 


ie prove that uh (x), 1), -L<x2<1. is the L>-projection onto the space 
of jinear functions on (—1, 1) of the piecewise linear function 
u" (x1, 0) given by (9.42) for -1<x,<1. 


Extend the analysis of the previous problem to the more general 
equation ug=0 with 6 chosen so that the inflow boundary 
P_=8K | U3K2. Based on this analysis make an interpretation of the 
discontinuous Galerkin method for the problem ug=0 as a method 
composed of two stcps: exact transport and projection. 
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9.12 Consider the discontinuous Galerkin method with k=1 for the 


problem 
du du 
—+y—=0, x, ERR, 
a’ dx me) 
u(x, 0)=g(x), xeR, 


with O<y=1. Use a uniform uiangulation of RXR* of the form: 


1,7) 


where xj=ih, tp=mh, i=0, £1, #2,...,n=0, +1, #2,... . Suppose 
we represent the discrete solution u h (. . ty) on the interval (xj-1. x) 
as 


u(x, t,)=UR+ t (x—xi) Vi, xe(xj-1, Xi). 


where sex Prove, using eg the result of Problem 9.11, that 


the discontinuous Galerkin method in this case is equivalent to the 
following explicit difference scheme: 


u} t -hy ust 
n | =U-¥) 2 n-1 
vy 6y = 1-2y-2y vi 
1 n- 
1 3U-y) Unt 
+Y P 


~6(1-y) —346y—2y? | | VIF 


connecting the values of the vector (U, V) associated with the 
O-marked points in Fig 9.12. 


9.13 Prove (9.38) and (9.39). 
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9.9 The streamline diffusion method for time- 
dependent convection-diffusion problems 


Let us now consider the time-dependent model problem (9.14) with e>0, ie 
the problem 


Uy FUy— Ey =f in Q=JxI, 
(9.43) u(x, 0)=uo(x) xeJ, 
u(x, )=0 x=0, 1, tel, 


where for simplicity we consider the case of zcro boundary data. With e>0 
we cannot apply the discontinuous Galerkin method otf the previous section 
to this problem; to handle the diffusion term —euyx, the trial functions should 
be continuous in the space variable. On the other hand, to be able to compute 
the discrete solution successively on one time level after the other, it is natural. 
if we insist on using space-time elements. to use trial functions which are 
discontinuous in time. Thus, we are led to consider a method where the trial 
functions are continuous in space and discontinuous in time based on a 
triangulation of space-time with the elements organized in strips in time eg 
as in Fig 9.11 or 9.13. 

To define such a method let O=ty<ty<. . .<ty=T be a subdivision of the 
time interval I=(0, T) and introduce the strips $, defined by 


Su={(x, 0): OMSK, typ Sti}, 


for n=]. ..., N. Further, for each n let V" be a finite element subspace of 
H!(S,), based on a triangulation of the strip S, with elements of size h>e, 
and Jet V"={veV": v(x, =O for x=0, 1}. (Notice that it is not necessary that 
(triangulations of different strips “fit’’ across the discrete time levels, cf Fig 
9.13). 


E> 


t+ 
sa 


Fig 9.13 
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If we now apply the streamline diffusion method (9.32) successively on each 
strip S, to the problem (9.43), imposing the initial value at (=(,—-1 weakly and 
the boundary conditions strongly. we obtain the following method: For n=1, 
..., N, find ue ¥" such that 


(ut +ul, v+8(vity,))8+<cul, ve2n7! 
(9.44) +e(uy, vx)" e8(uy,, vitv,)” 
Ve wa, 
where 6=Ch with C sufficiently smail for e<h, 6=0 for sh, 


=(f, vtO(vpt yy) <ul), vp! 


1 
(w, v)8= [wv dxdt, <w, v>"= fw(x, ty)v(x, tn)dx, 
8, 4 


v(x, t)= him v(x, tts), v_(x, th= lim v(x, t+s), 
ere iad 9.15 
and u? Sugsinitial data, and eS(u},, vi +vx)" is defined in a way analogous 
to (9.31). 
For each n (9.44) is equivaient to a linear system of equations and thus we 
have an implicit scheme (cf Problem 9.14). Further, since the space Vis 
independently defined on each strip with no continuity requirements from one 
strip to the other, the solution u? will in general have jumps across the discrete 
time levels ty. In the case eh and a suitably chosen triangulation of (0, 1)x(0, 
T) the method (9.44) would coincide with the discontinuous Galerkin method 
for parabolic problems presented in Section 8.4. 


Problems 
9.14 Consider the streamline diffusion method (9.44) with e=0 and 
J=(—, ©) on a regular space-time triangulation of the type 


t 


ot 
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Show that the method in this case is equivalent to the difference 
scheme: 
12), 452) pnt ( 1-1 23 _ 101 #42) up} 
=+fh+ 202] Ut} 4 | sacl- = - a SUM 
[I+da-d jure 6 ig 36% 368) 


\ 
+(2rveds i433] Ue +(hos- yee 2] un} 
ra 


3 4°] 6 6 36° (36 ted 
142,42 Jum af dye24 Lalu 
+[dedieds jum (a 9° 36 | ml 


21434 1a)ur hot oa . -(3+2) , 
+(2 hi dalus+(h 37 12} Viti 36 7 364] Se 


where A= At/Ax and UT*!-uMjh, (n+ 1k). 


Compare computationally the discontinuous Galerkin and the stream- 
line diffusion method for the problem (9.41) with the uniform 
triangulation of Problem 9.14. Consider the following cases for 
example: 

(i) u is smooth, 

(ii) u is piecewise smooth with a jump discontinuity, 

(iii) ug is a delta function, g=0. 


In the same cases also make a computational comparison with the 
following difference methods: 


UM =(1-\Up +h N15 (upwind 
method) 
f 2 
aiafAg | ys 2 5 (2-8) a (Lax- 
ui -(EeS} upc WU, 2 2) '*' Wendroff) 


(i-d) Unt} + +8)UP =(14A)UT +14) UP (box scheme) 
is 12,82 BY), 
Veg Uiatle a? a) ert 
cf 2 20 BV pra f{ LAG MMV ayn a 
(2 M+ Uy+ & st ere | (Shasta) 
Up EV i= 2Vje1-2Vj Vi -1) 
uptt= Ur! RUB + AUP | (leap-frog) 
where U? approximates u(j Ax, nAt) and era 
7 app j i 
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Example 9.5 In Fig 9.14 below we give the results after 49 time steps obtained 
by applying the discontinuous Galerkin method of Problem 9.12 and the 
difference methods of Problem 9.15 to (9.41) with A=0.56 and a step function 
as initial data. 


Upwing sneste Bou scheme 


lan-Wencroft Disc. Galerkin Leap frog 


STEP 49 


Fig 9.14 Comparison of the discontinuous Galerkin method and some difference 
schemes for a convection problem 


Example 9.6 Consider the convection problem in two space dimensions 
+B V=0 for xe Q, t>0, 
t 


u(x,0)=ug(x) for xeQ, 
where Q=(-1,1)(—1,1) and 6 is the velocity field 
B(x)=r(-sin 8, cos6), x= r(cos6, sin 8), 


corresponding to a clockwise rotation around the origin. In Fig 9.15a, b we 
give the initial condition up and the corresponding approximate solution 
u+(-,t) after rotation 360° obtained by the direct extension of the streamline 
diffusion method of Section 9.9. In Fig 9.16a, b we give the initial condition 
ug and the corresponding approximale solution after rotation 90° obtained by 
the streamline diffusion method with a certain amount of shock-capturing. 
Note that in this experiment the space-time finite element mesh was adaptively 
moditied automatically in each time step using the technique indicated in 
Section 4.6. The time step was chosen to be qual to the minimal space step 
on each computational ‘slab’ $,=QxTy, [p=(tn-1, te). O 
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LAMA 


OAPAViV/AVAVAVA 


BZ, 


a, Grapn and level curves for initial condi- b. Same as in a. for finite element solution 
tion together with inital mesh. afler rotation 360°. 


Fig 9.15 Streamtine diffusion method without shock-capturing for convection problem 
with fairly smooth inital condition 
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7 


a. Graph of initial condition and initial meso. 


tion together wilh mesh after rotation 90°. 


Fig 9.16 Streamline diffusion method with shock-capturing for convection problem 
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with non-smooth minal conition 


b. Graph and level curves of finite element soiu- 


9.10 Friedrichs’ systems 
9.10.1 The continuous problem 


In this section we will briefly indicate how to extend the streamline diffusion 

and the discontinuous Galerkin methods to the case of linear first order 

hyperbolic systems of the form (9.8), or Friedrichs’ systems (cf [Le]). We then 

consider first the following problem in a domain QcR# with boundary I: 
dou 


(9.45a) Lu = © Aj—+Ku=F in®. 
rer Xi 


(9.456) (M-D)u=0 on I, 


Here the Aj, K and M are given mXm matrices depending on x, u is an 
m-vector and 


d 
D= £ njAj, 
i-1 
where (nj, . .. ng) is the outward unit normal to T. We assume that the 
matrices A, are symmetric (with real clements) and that 
(9.46a) M+M*20 onl, 
dgAS is 
(9.460) K+K*- 394 2 in Q, 
1=1 OX} 


(9.46c) Ker (D~M)+Ker(D+M)=R™ on I, 


where o is a nonnegative constant, E* denotes the transpose of a matrix E 
and I is the identity matrix. For matrices F and F we have written E2F to 
mean that E-F is positive semi-definite, and Ker E={SeR™: E§=0). Under 
the conditions (9.46) (with o>0) and some smoothness assumptions onc can 
prove that if Fe[L2(92)]", then (9.45) admits a unique solution, (see [F]). 

Many problems in mechanics and physics can be written in the form (9.45). 
Let us here mention only two special cases. 


Example 9.7 The reduced problem (9.13) has the form (9.45) with m=1, 
A;=6,, K=1, D=B6-n and M=|D|. O 


Example 9.8 The initial-boundary value problem for the wave equation 


(9.474 ee 
) 3x4 x3 


f O<xy<]. Ixal<tl. 
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47>) w(x, “New, Y=0 O<m<I. 
(9.47c) WO, x)= (0,x)=0 fale, 
4 


where x, is a time variable, can be written in the form (9.45) with 


Q=(0,1)x(-1, 1), v=(uy, U2), ue pee 
d. 


10 0-1 0 0 
w[o tf ae[ 7 allo of, 


-| | for x}=0 or x)=1. 


. F=(f. 0) and 


0 


2 


-1 
1 0 for x2=1, 
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m-| =] 0 for mon M=| 


Note that the boundary conditions (9.47b) translate into the conditions 


w= St =o for |xoi—1, O<x)<1, which correspond to the conditions 


xX] 
2 0 uy 
#2 0 Ww 
Let us now generalize the standard Galerkin method, the streamhne 


diffusion method and the discontinuous Galerkin method to the Friedrichs’ 
system (9.45). We will use the following notation 


(M-D)u= =0 form=tl.0<xj<l. O 


(v, w=Jv-w dx, [Iva v7, 
Q 
<v, wo=lv-w ds, |vyja<y. v>!?, 
r 


By Green's formula we have 
(Ly, w)=<Dv, w>+(v, L*w), 


where 


so that in particular, 


(9.48) (Lv. v= H(LAL*yy, v5 <Dv, v>, 
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where by (9.46b), 
L+L* 2 al. 


We also introduce the spaces Vp=[Vnj™ and Wh=[Wa]™ with Va and Wh 
defined in Sections 9.4 and 9.8 above. We can now formulate our methods 
for (9.45). For the error estimates we also assume that M+M*2cl or T for 
some c>() (cf |JNP]). 


9.10.2 The standard Galerkin method 


Find we Vp, such that 
(Lu’, ve} <(M—D)u", v>=(F,v) — WveVp. 


Choosing here v=u" and using Green's formula (9.48) we obtain (with o>0) 
the stability estimate |lu"||+|u>|<C||F||, from which error estimates of the 
form |(u—u||<Ch'{ul|;+1 can be derived in the usual way. 


9.10.3 The streamline diffusion method 


Find ue V;, such that 
(9.49) (Luh, vt hLov) +5 <(M-D)u", v>=(F, v+hLov) Wwe Vn. 


where 


C) 


d 
eee 


Again choosing v=ul, we obtain (for h sufficiently small) the stability estimate 
(with again 6>0) 
V hf[Lou®||+ijuh||+<Mul, ubsl2<C||FI, 


from which we obtain the error estimate |[u—up|| = Ch'*!2|\ul|,,.), arguing as 
in the proof of Theorem 9.2. 


9.10.4 The discontinuous Galerkin method 
To formulate this method we need additional notation. For KeT, we write 


(, w)k=fvew dx, <v, w>=J vw ds, 
K OK 
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[wl =wrt—wett, 


witl(x)=lim w(y), we*'=lim w(y), xedK, 


yon yor 
yeK yeK 
d 
De= E Ani, 


where nk=(nk) js the outward unit normal to K, and where we set w°*"(x)=0 
for xel. Further, for each KET), we introduce matrices Mx defined on dK 
and satisfying, for K, K’€Th. 


Me=Mx: on 8KNEK’, 
Mx+My20 on 8K, Mx=M on Fr 


Here a possible choice on interior edges is Mx~Al with A>0. 
The discontinuous Galerkin method for (9.45) can now be formulated: Find 
ube Wy, such that 


(9.50) z (Lu. v)k-+5<(Mx—Dx)[u"). voK}=(E. vy). Wve Wa, 


For this method we again have error estimates of the form 


i 
|fu—ut||<Ch’ 2, 


Note that in the scalar case with m=1, choosing Mx=|B- nk} in (9.50), where 
B=(Bi), Bi=A., gives the discontinuous Galerkin method (9.34). 

The formulation (9.45) also includes time-dependent problems if we 
consider x; to be a time variable and choose eg Ay=I, Q=(0,T)x Q’ and M=1 
for x;=0, T. In this case we should modify the streamline diffusion method 
(9.49) following the pattern of Section 9.9. On the other hand, the discon- 
tinuous Galerkin (9.50) directly applies also to the time-dependent case, cf 
the following example: 


Example 9.9 Let us again consider the one-dimensional wave equation (9.47) 
written on system form according to Example 9.8, with now f=0 and non-zero 
initial conditions for x, =0: 


(951a) 2¢@4aS4=0 xa, O<xy<1, 
38x) 8x2 

(9.51b) = (M-D)u=0 xg=41, Ox) <1, 

(9.5lc) uO, x2)=u(0. x2) Ixal<1, 
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where. 


_| 0 =-1 _| 2 ¥1 
a-[_! ‘Jame 2 1 toraoasta 


Changing dependent variables through the orthogonal transformation 


1 a! | 
=Su, S=——= e 
a Al } 


u=S*q, 


we can write (9.51) as 


(9.52a) 98 4A SB_g, 


dx, 3x2 
(9.52b) — (M-D)@=0 X2=41, O<x) <1, 
(9.52c) (0, x2)=—p0, x2) Ix]<1, 
where 


Aqsas'-| 1 0). 


0-1 
M-D=SMS*—sps* | for x=-1, 
“| 0 0 
me? | for x2=1, 


Note that (9.52a) is an uncoupled system of two scalar advection equations, 
and that the coupling in (9.52) only occurs through the boundary conditions 

Suppose we now apply the discontinuous Galekin method to (9.52) with 
Mk=Mx given, on interior edges, by 

K,,K 
Z +nS} 0 m* 0 
(9.53) vie | inert =| 
0 |n Koy «| 0 mk x 

We then obtain a discrete analogue of (9.52) consisting of two discontinuous 
Galerkin discretizations of scalar advection problems which are coupled only 
through the boundary conditions (9.52b). Alternatively we may consider a 


direct application of the discontinuous Galerkin method to the coupled 
problem (9.51) with the Mx given by 


m¥+mk m}—me 


m&—mkm&emk 


, 


Me=ssihs-| 
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with the notation of (9.53). In fact these two approaches are equivalent and 
would produce the same numerical results since if v-S*y, we have 
(Au, v)e=(AS*@, S*V)K=(SAS*G, W)K=(AG, Wk, 

and corresponding relations hold for the remaining terms. Thus, we conclude 
that the discontinuous Galerkin method applied to the coupled problem (9.51) 
with proper choice of the Mx “automatically diagonalizes”’ the system (9.51a). 
The same holds for the streamline diffusion method. ‘This is of interest eg 
when analyzing the nature of propagation of effects in the discrete analogues 
of the coupled problem (9.51). O 


Remark 9.9 ‘The streamline diffusion method for stationary problems was 
introduced in [HB1|, [HB2]. The mathematical analysis of the method was 
started in [JN| and was continued, with extensions to time-dependent 
problems. in [J2], [Na], [JNP]. The method has also recently been extended 
with good results to incompressible and compressible flow problems. see 
[BH], [HFM], [HMM], (HM2], [HM2], [JSal], [J4]. [JSz1]. [JSz2], [Sz], and 
Chapter 13. For combinations of finite clement methods and methods of 
characteristics, see [DR]. [M]. [BPHL]. The discontinuous Galerkin method 
was first analyzed in [LRal, see also [JP2]. 0 


Problems 

9.16 Apply the discontinuous Galerkin method to (9.51) or (9.52) to 
compute approximate solutions of the wave equation (9.47). Test the 
performance of the method with different degrees of regularity of 
initial data as eg in Problem 9.15, Also compare with the results of 
other methods for the wave cquation such as cg those presented in 
Section 9.11 below 


9.17 Prove error estimates for the standard Galerkin and the streamline 
diffusion methods for (9.45) with o>0 and M+M*2cl, c>0, 


9.11 Second order hyperbolic problems 


Previously we have considered first order hyperbolic problems. We now turn 
to second order problems. A typical example is the wave equation: 


(9.54a) i-Au-f in QxI, 
(9.54b) u=0 on Ix], 
(9.54c) u(x, 0)=ug(x), u(x, 0)=u1(x), for xeQ, 
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where 2 is a bounded domain in R* with boundary T=(0,T), i= = and 


ms 
u= a . This equation models eg a vibrating membrane with given deflection 


ug and initial velucily uy. Following Example 9.8 it is possible to rewrite (9.54) 
as a first order hyperbolic system and in principle we may then apply the 
methods of the previous section. However, with this approach we introduce 
new unknowns which result in an increase in the number of variables in the 
discrete problems. Thus, there are good reasons to try to keep the formulation 
(9.54) involving second derivatives and a scalar unknown, However, with this 
formulation it docs not sccm to be known how to construct methods combining 
good stability with high accuracy. In particular, we cannot extend the 
streamline diffusion and discontinuous Galerkin methods to the wave 
equation (9.54) because of the presence of second order differential operators. 
Anyway let us here describe some methods tor (¥.54) that are currently used. 
The wave equation (9.54) can be given the following variational formulation 
using the notation of Section 8.2: Find u(t)e V=H)(Q), such that for tel. 


(9.5$a) (G(1), v)+ a(u(t), yy=({(L). v) VWvev, 
(9.55b) —-u(0)=up, u(0)=uy. 


The basic encrgy estimate for (9.55) is obtaincd by choosing v=u in (9.55a) 
which gives with f=0 for simplicity. 


ld a,1 4 2 
Sai ee aos = 
3 a Boll 5 ql Yall 0. el, 
so that 
(9.56) JJ act) 2+ || V u(t) |!2=constant=|[url|?+!| V uol|2. tel. 


This equation expresses the fact that the roral energy of the system (9.54) is 
conserved if the applied force f=0. 

For the numerical solution of (9.55) suppose the finite element space 
ViCHg(@) is given, and let us first formulate the following semi discrete 
analogue: Find u}(t)eV such that for te(0, T) 

(h(t), v)+a(uh(t), Y=(f). vy) We Vn, 
u(0)=u0n. U(0)=uin. 


(9.57) 


where upon. Uin€ Vn are approximations of the initial data uy and uj. This 
problem is equivalent to the following system of ordinary differential 
equations (using the notation of Section 8.2): Find €(t)eR™ such that 


BE+AE=F, tel, 
&(0)=0p, &(0)=0, 


where 0) and 6; are the coordinates of ugh and ujp with respect to the basis 
{p1,..., pu} of Vy. Clearly (9.57) satisfies an energy conservation relation 
analogous to (9.56). 

It now remains to discretize the second order system of ordinary differential 
equations (9.58) with respect to the time variable. To this end it is convenient 
to rewrite this system as follows: 


(9.58) 


Byt+AE=F, tel, 
(9.59) E-n=0, tel, 
€(0)=0p. n(0)=81. 


Now, Ict O<to<ti, . . ., <tn=T be a subdivision of [ with time steps 

kp=tn-typ-1. We may consider the following class of time discretization 

methods for (9.59) (with F=0 for simplicity): Find (6", n"}eRMxR™M, n=0, 
. , such that forn=1,2, ..N, 


B (=) +aAEMtI4(1-a)AEM=O, 
a 


n+lig 
Kn 


= 09, n°=61, 


(9.60) ” (nt 4 (i-yn=0, n=1, 2,05 


where 0 <a, y<1 are parameters. This method ts unconditionally stable for 


a, y= and second order accurate if (a, y)= is » forexample. With (a, y) 


=(0, 1) and a uniform subdivision in time, the scheme coincides with a 
well-known explicit second order centered scheme with no artificial viscosity. 
This particular scheme performs very well if the exact solution is smooth but 
not so if, for example, the initial data up has a jump discontinuity, in which 
Case severe spurious oscillations occur. With (a, y)=(1, 1) we geta first o1der 
accurate implicit method with better stability properties but with heavy 
artificial viscosity. For a scheme similar to (9.60) which has been used 
extensively in applications, sce Problem 9.19. 


212 


Problems 


9.19 


9.20 


9.21 


(9.61) 


Prove a stability estimate for (9.60) in the case= <a<1,- <y<1. 


L 
"2 
Hint: Multiply the equations by (€°*!—&") and B(n®*!—n), respec- 
tively. 


hole 


A class of time discretization methods for (9.59) well-known in the 
engineering literature is given by the so-called Newmark method (with 
F=0), 


BgM I =BEP-+kyBr®—ka BAS" +(5—8) AB"), 
Byt*!=By?—ky( YAR" +(1- YAR), 
where B and y are parameters satisfying 0<B<t, O<Sy<1. Prove that 
this method is unconditionally stable for Bez}. Nate that with B=0 


and ¥=3 and a uniform subdivision, we retrieve the centered scheme 
obtained by taking (a, y)=(0, 1) in (9.60). 


Write the wave equation (9.54) as a first order Friedrichs’ systems (cf 
Example 9.8). 


Consider the method (9.60) with (a, y)=(0, 1) and k,=k, n=1, 2, . 
... N, or equivalently the following centered scheme for (9.57): Find 
U, n=0, 1, 2,..., N such that forn=1,..., N-1, 


(U8t!-2U"4+ Ul"), v)+a(U, ¥)=(f(tn). ¥) VWEVh, 


US supp. U' =U +kuyp. 
Prove that this scheme is conditionally stable under the condition 
k<Ch with C sufficiently small. Hint: Rewrite (9.61) using the notation 
wr=(U"—U""))/k and take v=(W""!+ Wk to give 

(wath, warty Sv une ye=|we, w+ sIVUAP, 


where 


Ke 2 
Iw, w]—-(w, w)— pal), OW, w)=|[wll2=lwll 7, cay 


10. Boundary element methods 


10.1 Introduction 


In this chapter we consider finite element methods or BEM, boundary element 
methods, for some integral equations arising in connection with certain elliptic 
boundary value problems in mechanics and physics (the presentation is based 
on [N], ef also [W]). As an example of such a problem let us consider the 
following exterior Dirichlet problem: 


(10. La) Au=0 in Q’, 

(10. 1b) u=ug on I, u(x) 0 as |x| > &, 

where Q is a bounded simply connected open set in R? with smooth boundary 
[, Q'=R3\Q is the complement of Q= QUT (sec Fig 10.1). Further denotes 


differentiation in the direction n, where n=n(x) is the outward unit normal 
to T at xeI. This notation will be kept throughout this chapter. 


n 


Problems of this type occur in fluid mechanics and acoustic scattering, for 
instance. In the latter case the equation Au=0 is replaced by Au+i?u=0 
where w is a given frequency. Instead of (10.1a) one may also consider 
homogeneous, constant coefficient elasticity or Maxwell equations corre- 
sponding to elastic or electromagnetic scattering. 

Since the domain Q’ is unbounded, we cannot triangulate Q’ using a finite 
number of triangles, and thus we cannot apply the finite element method 
directly to this problem. To get a finite number of elements the first idea is 
simply to replace 2’ by the bounded domain Q§= {xe92’: |x| <b} for some 
suitably large b and use the approximate boundary condition u(x)=( for 
|x|=b. To get reasonable accuracy one may have to choose b quite large, and 
then this procedure may be too costly. 

As we will see below, since the differential equation (10. 1a) is homogencous 
il is possible to reformulate the problem (10.1) as an iategral equation on the 
closed and bounded surface [. Applying a standard Galerkin or finite element 
method to solve this integral equation numerically, we obtain a boundary 
element method. 

We will meet below integral equations of the following types (named atter 
the Swedish mathematician Ivar Fredholm 1866-1927). 


Fredholm equation of the first kind: Given f: T+ R and the kernel! k: | XP 4R 
find q: PR such that 


(10.2) Sk(x.y)atyddy(y)=Mx), xe. 

Fredholm equation of the second kind: Given {: [> R and k: xP > R find 
q: PR such that 

(10.3) q(x) + Sie(x, yalydy(y)=f(x), xe 

Here dy is the clement of surface arca on T and dy(y) indicates integration 


with respect to the variable y. The kernels k(x,y) that we will meet, will be 
weakly singular; more precisely we wil] have 


(10.4) k(xy)= FY, xy, 


Ix-y| 


where ¢(x,y) is a bounded function of x and y. Recall that a singudar kernel 
in two dimensions (TP is two-dimensional) behaves like |x—y|~? as x y. In 
particular, a weakly singular kernel satisfying (10.4) ts integrable, ie, 


[IkGuy)ldyQ<, xeD, 
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If we introduce the notation 


(10.5) Kaq(x)= k(x, yqly)dy(y), xe. 
rt 


then we may say that K is an integral operator; given a function q defined on 
T, a new function Kq is defined on [ by (10.5). With this notation the problems 
(10.2) and (10.3) can be tormulated as 

(10.6) Kq=f (Fredholm first kind), 

(10.7) (I+K)q=f (Fredholm second kind), 


where J is the identity. In the applications below, the kernel k in (10.6) will 
be symmetric, ie, k(x,y)=k(y,x), x, ye, while the kernel k in (10.7) will be 
non-symmetric. 


10.2 Some integral equations 


Let us now briefly recall some of the integral equations that arise in connection 
with various boundary value problems for the Laplace equation. Let us then 
start by recalling that the fundamental solution for the Laplace operator in 
three dimensions is given by the function 


E(x)=— Gaal 


By this we mean that 

{ECA e(Idx= 910). 
for all smooth functions y in R} vanishing outside a bounded set. In other 
words, 

AE=6, 


where 8 is the delta function at 0 (cf Problem 10.2). In particular, we have 
that AE(x)=0 for x#0. Next, let us recall the following representation 
formula: 


Theorem 10.1 If u is smooth in Q and &’ and 
(10.8a) Au=0 in Q and Q’, 
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(10.8b) u(x) =0(|x171). | VuG)/=0(/x]72) as fal, 


then 


A {[ 2] peo aa ) ary] 


ny\ |x—y! 
(10.9) u(x} if x¢l, 
wes uee) if xeY, 


where for xe[ (i=interior, e=exterior), 


ae ee 


=u—ut 
[Pus an én an’ 


ui(x)= lim u(y), uS(x)= lim u(y), 
y>x yox 
ye yen’ 


dul ‘ 
=] 
8n &) pa 


u(xtsn)—u(a) | due (x)= lim 
s on s— 0+ s 


and 2 indicates differentiation in the direction n(y). 
n 
y 


Proof First let x be a given fixed point in Q and define (cf Fig 10.2) 
Q_= {yeQ: ly—xl>e}, 
Se={yeR*: |y-x|=#}, 


where e is so small that $,cQ. 


Fig 10.2 


u(Qvtsn)—u(x) ; 
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1 


Applying Green’s formula on Q, to the two functions u(y) and WY)=Z ; 
Ixy 


which satisfy Au= Av=0 in Q,, we get 
(10.10) 0= fu Avdy— fv Audy= fui dy— f vay 

2, Q, roan roan 

ov du 
~ = dy+ — dy, 

gan H “an! 
where on S, the normal n is directed outwards (see Fig 10.2), Since u is smooth 
in $2 so that a is bounded close to xe $2, and since the area of S, is equal to 


4x, we have 


fe) 
(10.11) | fv “| < cf) - ceoas e-+0. 
3. on 5, 4alx—y| 
Further, for ye5, 
av 1 
y= 


dn : ~ anlx—yl2 : 
and thus 


ov 1 
u— dy=—-—— Jud —u(x) as e> 0. 
f 3 y ina & ee ze 


Hence, letting _ 0 in (10.10) we find, using similar arguments for xeF and 
xe’, that 


u(x) xeQ, 
1 duit ) 1 1 
10.12 - —fu -liw 
(10.12) SL Sr ie tO-2 an, (jeep! HD] -] 3 ww Her 
0 x€EQ’. 


A corresponding result can be obtained by applying Green’s formula on the 
exterior truncated domain 02 = {y¢Q’: |ylSb} and then letting b— « using 
(10.8b). Together with (10.12) this yields the desired representation. O 


Remark The kernel 


| arene ¥ 5 3 1 
is said to be a single layer potential and —— (; ) 
Ix-yl es any \[x—yl 
a double layer potential. LU 


Let us now derive the integral equations that we want to study. 
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10.2.1 An integral equation for an exterior Dirichlet problem 
using a single layer potential 


Let us consider the following exterior Dirichlet problem: 
Au=0 in Q’, 
(10.13) u=ug onl, 
u(x) =0((x(="), [VuGa)!=0(/x| 2), as |x] &. 


One can show that if ug is sufficiently regular (more pieciscly, if ug is the 
restriction to T of some function weH!(R3)), then this problem has a unique 
solution u. We can extend this solution to the interior of Q by letting u satisty 
Au=0 in Q, 
(10.14) 
u=ug on’, 
We know that under the condition an ug just stated, this problem also admits 


a unique solution. 
By the representation formula (10.9) we now have since [u]=0 on [: 


du 


an 


— dy) x4, 
Ixy! 


(20.15) ued= ef 


and since (uit u®)/2=up on [, 


1 ou it : 
=— Jf] —| ——dyly), xel. 
uo(x) me si Rl v(y) 
Thus, writing 
du 
10.16 =| = 
0.16) q=| 2, 
we are led to the following integral equation; Given uy find q such that 
coin bs 8 ayyy=u000. xeP, 
4np |x—y} 


This is a Fredholm integral equation of the first kind with weakly singular 
kernel. Clearly the kernel is symmetric. One can show that for a Jarge class 
of functions up, (10.17) admits a unique solution q. More precisely onc can 
show that if uy€H*(P), then there exists a unique qe HS~!(T) satisfying (10.17). 
Here and below H‘([) denotes the Sobolev space of functions defined on T 


with derivatives of order sin L2(T). With | 2] determined from (10.17), 
n 
we obtain the solution u of (10.13) (and (10.14)) by the formula (10.15). 
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Another way of obtaining the integral equation (10.17) is to start out by 
seeking a solution of the exterior Dirichlet problem (10.13) of the form 


(10.18) u(xy= Ls 9% ayiyy, xeQ" 
4xr |x—yl 
Since A, =| =0 tor x#y, where A, indicates that the derivatives are 
x-y 


taken with respect to the variable x, it is clear that a function u given by (10.18) 
satisfies Au=U in Q’. Now letting xT and using the fact that the right hand 
side of (10.18) is a continuous function of x, we oblain the integral equation 
(10.17) for the unknown density q. 


Remark The function u defined by (10.18) for xeR? may be interpreted for 
instance as the electric potential given by a distribution of electric charges on 
T with density q. or the temperature given by heat sources on IT with intensity 
q. O 


10.2.2 An exterior Dirichlet problem with double layer potential 


When considering the exterior Dirichlet problem (10.13) let us replace (10.14) 
with the following interior Neumann problem: 


Au=0 in Q, 
(10.19) 

au _ dui 

gua ews r, 

an 6n 7 an 


e 
where eae. We recall that a necessary condition for (10.19) to have a 


solution is that 


(10.20) fedy= |“ ay=o, 
V ren 


and further, a solution of (10.19) is unique only up to a constant (if u is a 
solution of (10.19), so is ut efor any constant). Since| | =0 in this case, 
n 


the representation formula (10.9) gives 


eeisreu) +f ful 2 (z jan. xe, 


4mT  OMy\ IK-y 
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so that 


cr(nyee Hod 2) , wie) 1) 


1 


Ix= 


= ¢ 1 = | T, 
in 9) Sal yoyo): xe 


where 
o=[u]=u'-us. 
Thus, since u°(x)=uo(x) for xe’, we are led to the following Fredholm integral 


equation of the second kind: 


1 3 ! YC ¥ = — U(X x 
eh #0) Soc[ Frag) =H. wer 


(10.21) 2) 


By the representation (10.9) we also have for x¢P, 
1 af il } 
22 = S|) + | ayy. 

(10 u(x) ree ly) an bal y(y) 
Note that the right hand side of (10.22) is not a continuous function of x: this 
function has a jump equal to @={u] across T. 

To see that (10.21) is an integral equation with a weakly singular kernel of 
the form (10.4), we observe that for x, yeP, x#y, 


8/1 2 ny) Gey) | 

any\Fe=wll by 

Now, if I’ is smooth, then n(y) is almost orthogonal to (x—y) for x close to 

y (sce Fig 10.3). More precisely, since T is smooth it is easy to show that 
nly): (x—y)=[x-y/?ex, y). 

where ¢(x, y) is bounded function, and thus the kernel in (10.21) satisfies 


(10.4). ae 


Fig 10.3 


1 
1 


One can show that if again ug is the restriction to T of some function we H!(R3) 
and (10.20) holds, then (10.21) admils a solution y which is unique up tu a 
constant. 


Problems 
10.1 Prove that 


2/3/23 
4x7 Any) |x-y| 
0, xeQ’ 
What can be said about the uniqueness of a solution of (10.21)? Q 
10.2 Prove using the technique of the proof of Theorem 10.1 that 


E(x)= ctl, is a fundamental solution for the Laplace operator in R? 


4n|x| 


10.2.3 An exterior Neumann problem with single layer potential 


Let us now consider the exterior Neumann problem 


(10.23a)  Au=0 in 2’, 
(10.23b) Stay on T, 


(10.23c) — u(x)=0(/xI7), | Vucx)|=0(|x!-2) as [x] &. 
One can show that if g is sufficiently smooth (more precisely, ifg= oe for 
n 


some function weH!(R3)), then (10.23) admits a unique solution (note that 
for this exterior problem to have a solution it is not necessary that g satisfies 
(10.20)). With (10.23) we associate the interior Dirichlet problem: 


Au=0 in Q, 
(10.24) 
u=ue on T 


Sinve then [u]=0, we have by the representation formula (10.9) 


(10.25) u(x)= a am. xeR, 
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where | 2] . Now, it can be shown that if u ts given by (10.25), then for 


xer, 


Bue) 99 4 1 pay) 2 (2 
an @) 2 ah) =a may) 10: 


and using (10.23b) we are thus led to the integral equation 


1 1 3 1 
(10.26) 5 q(x)— = Saly) wy] dy(y)=—g(x). xe. 

2D 4aT nx \ [x—y| 
This is again a Fredholm equation of the second kind with non-symmetric 
weakly singular kernel satisfying (10.4). One can show that for any geL2(T) 


there exists a unique solution qeL2(T) of (10.26). 


10.2.4 Alternative integral equation formulations 


It is also possible to use a double layer potential for the exterior Neumann 
problem. Further, if in the representation formula (10.9) we take u=0 in Q, 
then 


au il 
dn |x—yl 


3 
ony 


(10.27) u{x)= xl - i dy(y)+ fu ( mil dty)} for xeQ’ 


where u=u‘ on I, and 


- dy it} pou 9 ! 
(1028) tucy= [281 event fo 2 (a, 


for xeT 


Equation (10.28) gives an integral equation of the second kind for ulr if oui 
in 
is known, and an cquation of the first kind for > Ir if ul) is known. Thus, 
n 
the exterior problems (10.13) and (10.23) can be solved by first solving for 


ulr or ar in (10.28) and then using the representation (10.27). 
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10.3 Finite element methods 
We shall now consider finite element methods for the numerical solution of 


the integral cquations (10.17) and (10.26), that is, the cquations 


(10.29) tf 9M ayyy=uoey, —xer, 
4nTix-yl 
1 l 38 L = 
(10.30) 500) Z fase ( AT] enor= g(x),  -xeP, 


For simplicity we consider only the case of piecewise constant finite element 
approximations. Let Th={Ku, . . .. Ky} be asubdiviston of Tinto “elements” 
Kj (eg “curved” triangles or rectangles, cf Fig 10.4) of diameter at most h. 
We introduce the finite element space 


Whe (veLa(P): v|x, is constant, =I, . . ., M}. 


We will use the basis (1, ., pa} for Wy, where each +p; is equal to one 
on K, and vanishes on K, for j #1. 


Fig 10.4 


£0.3.1 FEM for a Fredholm equation of the first kind 


Multiplying the equation (10.29) by a test function p(x) in some test space W, 
as yet unspecified, and integrating over [, we are led to a variational 
formulation of (10.29) of the following form (cf [NP]): Find qeW such that 


(10.31) -b(q, p)=Ap)— VpeW, 
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where 


bq, Dare ie dy(y)dy(x), 
Ap)= f Uo(X)P(x)dy(x). 


This leads naturally to the following Galerkin method for (10.29); Find qhe W, 
such that 


(10.32) b(q®, pty=l(ph) Vphe Wh. 

Using the basis (1), . . ., ya} this relation can be formulated equivalently 
as the linear system of equations : 
(0.33) BE=/, 


where §=(E1, . . ., Ege RM, 
M 
qh= = Eyyj- 
jel 
and B=(bj), /=(4) with 


(10.34) by= f p HeOGY(s) 


KK, [x-y! 
4=Jugdy, i j=l... .,M. 
x, 


The form b(., .) is bilinear and evidently symmetric. As we shall see, bis also 
Positive definite, (ie, b(p, p)20 with cquality only if ps0). Thus, we may 


define a norm || - ||w by 
(10.35) |Ipilw=b¢p, p)". 


To see that b is positive definite we recall from Section 10.2 that if 


(10.36) v(x)= [PO any), xeR’, 
4nr|x—y| 


then 


(10.37a)  Av=0 in Q and &"', 
(10.37) | | =p and [v]=0 on Fr, 
(10.37) v(x)=O(IKI 79). | V v(x) =0((xi-2) as [xi oe, 
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Using Green’s tormula it follows that 


J (VvPdx= [ Vv. Vvdx+ f Vv: Vv dx 
RS Q ae" 
= i ov 4g — OVE = dy 

ae ya {ves ar= [vpdy, 


since vi=v¢ on T. Thus, multiplying (10.36) by p and integrating over [, we 
have 

1 x 

vip, p= 2 f [POPC ay(ayay(yy= [) 7vPdx. 

4nrr |x- yl Re 
Hence b(p, p)=0, and if b(p. p}=0, then Vv=0 so that v is constant in R3, 
and hence by (10.37b) it follows that p=0. Thus, b is positive definite. 

It now follows immediately that (10.32) admits a unique solution qeWh 

and by our general theory for finite element methods for elliptic problems 
from Chapter 2, we have 


(10.38) Ilq-ahllw<|laq~p"llw Vp Wa. 


It can be shown that the norm || - || is (stightly) weaker than the L2(T)-norm 
(ie, IIpllwSClip|itery and in particular we have WycW), and thus using the 
L2-norm on the right hand side of (10.38) we find by the usual approximation 
theory 


la-a'lhy < Chllqliin)- 


Remark 10.1 For readers familiar with Sobolev spaces, let us mention that 
here W=H7!(T) (cf [Ad]). 


Remark 10.2 One can show that the condition number of the linear system 
(10.33) is O(h-!) if the triangulation Ty is quasi-uniform (this is of course 
related to the fact that |(pl] <b(p, p)<Cllp|f. qr). where the last inequality 
is easily proved). Thus, for realistic choices of h the system (10.33) is quite 
well-conditioned. ‘his is in contrast to some other Fredholm integrat 
equations of the first kind having smooth kernels which may be very 
ill-conditioned and thus difficult to solve numerically. Cl 


Remark 10.3 A targe part of the computationa! work will have to be spent 
on computing the coefficients bj, defined by the double integrals (10.34). If 
the elements K, and K; are not very close, then simple onc-point quadrature 
for each integral in (10.34) may be used. If K; and Kj are very close, then one 
has to be more careful and use special quadrature rules (ct [JS1] and Problem 
10.5). O 
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Remark 10.4 Note that the matrix B in (10.33) is dense; we have bi #0 
Vi, j. Thus, to solve (10.33) by Gaussian elimination requires 0(M3) 
operations. © 


Problem 
10.3. Estimate the number of operations needed to solve (10.33) by the 
conjugate gradient method. 


10.3.2 FEM for a Fredholm equation of the second kind 

Let us now consider the equation (10.30) which we write as 

(10.39) (I-K)q=f, 

where f= 2g, I is the identity operator and K:L9(C)— L2(T) is the integral 


operator defined by 


(10.40) Kas)= 2 fay 2 t Je, xe. 


One can prove that given feL(I) this problem admits a unique solution 
g¢L2(I), and for some constant C independent of f 


(10.41) fiql|=C|If1|=C]|(I-K)q]]. 
where (|- {| denotes the I.2(P)-norm, ie, 


Ipll=(fprdy)"?. 


Let us now consider the following Galerkin method for (10.39): Find qhe Wh 
such that 


(10.42) (q?, p®)-(Kq*. p')=(f, pt) vpte Wa, 
where 
(4. p)= fapay. 
The retation (10.42) is equivalent to the tollowing system of linear equations 
(10.43) (D-B)E=!, 


where 


M 
qh= 2 5iv S=(E1, - . ., EueR™, 


B=(bij), D=(dj) is a diagonal matrix and /=(4,) with 


1 a/{_i 
bg= = J S| —S] dyyydv@),  dj= Say, 
ij alielcal vy)dy(x), dj Hi Y 
h= Jf dy. 
k, 


Again, the matrix B is dense but this time non-synumetric. For the computation 
of the b,j Remark 10.3 again applies. 

Let us analyze (10.42) and then reformulate this equation using the 
following notation, Let Py: L2(T)—> Wp, be the Lz-projection defined by (cf 
Problem 4.8). 


(Pq, p")=(a. p*) vpheWh, qeLa(T). 
By this relation it follows that Pap=p if pe Wh and 
(Paq.p)=(Paq Pap) =(q,Pap) Vp. geLa(l). 
Since (10.42) can be written as 
(q',Pup)—(Kq",Pap)=({Pap) Vp La(r), 
we conclude that 
(q".p)—(PaKq",p)=(Prf.P) VpeL2(P), 
or equivalently, 
(I-Ky)qh=Pat, 


where Ky: L2(Q)— Wy is defined by Ky=PyK. To sum up, the continuous 
problem and its discrete analogue can be formulated as the following 
equations in Lo($2): 


(10.44) (I-K)q=t, 
(10.45) (I-Kpy)q"=Paf, Kp=PaK. 


We now want to prove a stability estimate for (10.45). Once this has been 
done, we obtain uniqueness and hence also existence of a solution to (10.45) 
and we can directly obtain an error estimate. To prove the stability of (10.45) 
we shall use the stability (10.41) of the continuous problem (10.44) and the 
following crucial property of the integral operator K: 


Proposition The operator K defined by (10.40) is srnoothing; morc preciscly 
there is a constant C such that 


(10.46) ||Kp}lingr$CllpllLry, VpeLo(T). 
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Here H!(L) denotes the space of functions defined on T with first derivatives 
in L,(T). This result says that if peL2(f). then Kpell(P), and thus by 
applying the integral operator K, we increase the regularity by one derivative. 
Such a smoothing operator is also said to be compact. 

We will also use the following error estimate for the projection operator 
Py, cf Problem 4.8, 


(10.47) |Ip—Papl!<Chllpllinry. 
Note that since W,, consists of piecewise constants, we have in our case 


Prapl= fp dy dy = mean value of p over K,, i=)... ...M. 


Now, combining (10.46) and (10.47) we find that for peL»(T) 
|(K~Kn)p!/=||(-Pa)Kp}SChI[Kplliz,cr<Chl|pi, 

Thus, 

(10.48) ||(K—Kpy)pi|sChlip|!, ¥peL2(T), 

which may also be written as |[K—KyliCh where |/A|/ denotes the operator 

norm of an operator A: L2(C)-> La(L), ie, TAl/=sup {|]Apj|/|Ip||: peL2(T)}. 

We can now prove the desired stability estimate: 
Lemma 10.1 There are constants C and ho such that for hho and peL2(P) 


IIpI'=C|]— Ky)pll. 


Proof Combining (10.41) and (10.48}, we have 
|Ipl!=C||-K)p|!=Cll(I—Kn)p—(K—Kn)p|| 
SC|(I—Ky)p]] +C{\(K—-Ki) pl 
<Cl|(I—Ky)p]||+Cibl[p||. 

so that 
(1-Cih)l|pli<Cll@—Kw)pll, 


and the lemma follows by choosing, for instance, Ciho=5, 


Using this stability estimate we easily obtain the following error estimate. 


Theorem 10.2 There are constants C and ho such thal if qeH?(L) and q’e Wr, 
are the solutions of (10.44) and (10.45), then for hho, 


(10.49) ||g—qh||<Ch. 


Proof Substracting (10.45) from (10.44) we get 
(Ka) (q-g")=(K—Kn)q + (1- Pre, 
so that by Lemma 10.1 
liq~qh\|SC(/K-Ky aii +||(I- Pal). 
‘The desired estimate now follows from (10.47) and (10.48). D 


Remark 10.5 By Lemma 10.1 it easily follows that the condition number of 
the matrix (D—B)"(D-—) 1s bounded independently of h and thus (10.43) 
can be solved efficiently by, for example, the conjugate gradient method 
applied to the least squares form of (10.43): (D-B)™(D-B)E=(I-B)™ O 


Remark 10.6 In certain applications it is of interest to combine the usual finite 
element method and the boundary element method. This ts the case. for 
instance, if (10.1a) is replaced by the non homogenous equation Au=fin Q’, 
where we assume that the support of fis bounded so that for some b>0, f(x)=0 
for |x|2b. We also assume that Qc{x:|x|<b}. The resulting problem may be 
discretized using a standard finite element method on the bounded domain 
Qh={xeQ’: |x/<b}, together with an integral equation on the surface 
Ty={x: [xl=b} which connects the unknown values of u and the normal 


derivative oe on]. Inthis method finite clements are thus uscd to discrctize 
in 


the bounded domain Q |, where f0, and a boundary integral method is uscd 
to handle the unbounded region {xeQ’: |x/>b} where f=. For more 
information on this topic, see [JN]. O 


Problems 
10.4 Consider the integral operator K:L2(I)— L(I), 1=(0, 1), defined by 


Ka(a)=Jk(x, y)a(y)dy, xel, 


where k(x,y)=1 if yex and k(x,y)=0 if y>x. Prove that 


[Kalla SCilqiitya, ql-2(1). 


10.5 Consider the following integral with weakly singular integrand: 


(10,50) 2) say. 
Kk (x+y) 
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where K is the triangle with vertices (0, 0). (1, 0) and (1, 1) and gis 
a smooth function. Prove that the change of variable y= xz transforms 
the integral (10.50) into 


(10.51) | fee Dana 


This integral has a smooth integrand and may be computed using 
standard numerical quadrature with few quadrature points, whereas 
the same approach for the weakly singular integral (10.50) gives poor 
results. This ‘“‘trick” may be used to compute the elements bj in eg 
(10.34) when Kj and Kj are close (in fact here the integrals with respect 
to x may be replaced by numerical quadrature with quadrature points 
at the nodes of the triangulation which leads to integrals of the form 
(10.50) to be calculated, cf [JS1]). 
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11. Mixed finite element methods 


11.1 Introduction 


In this chapter we briefly discuss so called mixed finite element methods which 
generalize the basic finite element method for elliptic problems described in 
Chapters 1-5. As an important example we shall focus on a mixed finite 
element method for the following Stokes problem in two dimensions (cf 
Section 5.2): Find the velocity u=(u), ua) and the pressure p such that 


(11.1a) —Au+Vp=f in Q, 
(11.1b) div u=0 in Q, 
(11.16¢) u=0 on’, 


where Q is a bounded domain in R? with boundary F and f=(f), f) is given 
(here of course (11.1a) is a vector equation, cf (5.6a)). Note that the pressure 
p is only determined up to a constant; if (u, p) solves (11.1) then (u, p+c) 
also solves (11.1) for any constant c. To obtain a unique pressure, we may 
for example impose the extra condition 


Q1.1d)  fpdx=0. 
Q 
Let us now give a variational formulation of (11.1) which generalizes the 


previous formulation (5.7). We shall seek u and p in the spaces V and H 
defined by 


V=[Hj(Q)7? = (v=(v1,¥2): vie HQ), i=1, 2}, 
IT= (qeL2($2): fqdx=0}. 
Q 
Notice that the velocity space V is here not restricted to divergence-free 
velocities as was the case in (5.7). Now multiplying (1I.la) by veV and 


integrating by parts, and multiplying (11.1b) by qeH, we are lcd to the 
following variational formulation of (11.1): Find (u, p)éeVxH such that 
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(11.2a) (Vu. Vv¥)—-(p.div vy=(f,¥) Wey, 
(11.2b) (q,div u)=0 VqeH, 


where (...) denotes L3-inner products, so that in particular 
2 2 
(Vw. Vv)= = f Vw, Vvidx, (hv= 2 ffividx. 
i=10 ila 


A natural idea to geta discrete analogue of (11.2) is now to replace V and 
H by finite-dimensional subspaces Vp) and Hp. This gives the following 
method: Find (up, pn)€ Vn Hp such that 


(11.3a) (Yup, Vv)— (ph, div v)=(f. v) VWveVh, 
(11.3b) (q, div up,)=0 VWaqeRhn. 


A method for (11.1) of the form (11.3) is called a mixed (finite element) 
method; the term mixed refers to the fact that in (11.3) we seek independent 
approximations of both the velocity u and the pressure p. With Lhe formulation 
(11.3) we do not have to explicitly construct a finite clement space of 
divergence free velocities as in (5.7), something which is difficult to do using 
low order polynomials (cf Section 3.2). Thus, the formulation (11.3) opens 
the possibility of working with velocities that only satisfy the zcro divergence 
condition approximately through the discrete zero divergence condition 
(11.3b). However, we have to pay for this added freedom in the choice of Vy 
by introducing the pressure space Hy (cf Remark 11.1 below) 

In order for (11.3) to be a reasonable discrete analogue of (11.2), the spaces 
Vy and Hy will have to be conveniently chosen; not just any combination will 
work. Loosely speaking, we want to choose V,, and Hp, so that the resulting 
method is both stable and accurate. These demands are in some sense 
conflicting and one has to find a reasonable compromise. Below we will 
consider in detail onc special choice of Vp and Hy for which stability is easily 
proved but which is not optimally accurate. We will also bricfly give some 
methods with improved accuracy but omit the more elaborate proofs needed 
to prove the stability in these cases. Recently, modifications of (11.3) with 
additional stability have been introduced, see Problem 11.3. In these methods 
the spaces V, and Hp can be chosen more independently. 

Let us now return to the discrete Stokes problem (11.3) and consider first 
the stability of this problem. A natural stability inequality for (11.3) would 
be the following: There is a constant such that if (un, ph)€ Vax Mp satisfies 
(11.3), then 


(11.4) {lualli+Ilpal!os=Clifil 1, 
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where, (cf Remark 4.3) 
(fy) 


fllj= s = 2 +1 2 2 
Vfl) sup Tay lh (Ivalléecayt !lvall erga)! 


and |Iqllb=liqlitya). The velocity estimate in (11.4) is obtained easily as 
follows. Taking v=uy in (L1.3a) and q=pp in (11.3b) and adding the resulting 
equations, we get 


unlii=(G un)slifl! -allunli, 
so that 
(11.5) {unt ssC]Hl| -1. 
Next, we want to use the relation (11.3a), ie, 
(11.6) (pn, div v)=(Vuy, Vv)-(f, v) Wwe Vp, 
to conclude that 
(11.7) IIpnllosC}unl!: +Ufi-1), 


which together with (11.5) will prove the desired estimate (11.4). To be able 
to conclude (11.7) from (11.6), we mccd the following estimate: There is a 
positive constant c such that for all qe Hp 


(11.8) sup 4) Sajigi,, 


veVy, Ilyi 1 


Iris clear that by using (11.8) we obtain the pressure control (11.7) from (11.5) 
and (11.6). The inequality (11.8) is called the Bahuska-Brezzi condition (for 
the method (11.3)) and is the crucial inequality that will guarantee stability 
of the mixed method (11.3). Once (11.8) is established one can easily prove 
the following optimal error estimate for (11.3) (cf Problem 11.1 and [Br]): 


(11.9) Ju-ugil+lip—prilosCC inf llu-vlli + int |Ip—allo). 
veVy, qell, 


11.2 Some examples 


We will now consider the problem of constructing the spaces Vp_ and Hj so 
that we will be able to verify (11.8). ‘This is in fact easy to achieve by simply 
taking Vp large enough; the real challenge is to try to choose Vj nearly as small 
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as possible and thus to find a good balance between Vp and Hp. In the first 
example we will have a situation where in fact the space Vj is “too big’. 

For simplicity we will in the examples below assume that Q is a square with 
a uniform subdivision Ty={K} of Q into squares K with side length h. The 
methods to be presented can be generalized directly to quadrilateral elements. 
cf Chapter 12. 


Example 11.1 Let 
(1L.10a) -V_={veV: vige{Q2(K)2, VKeTh}, 
(11.10b)  Hy={qeH: qixeQo(K), WKeT)}. 


In other words, Vy consists of continuous piecewise quadratic velocities and 
Hy of piecewise constants. We will subsequently verify that (11.8) holds in 
this case and thus we have by the error estimate (11.9): 
Ju-unlli+llp—pallossCh(hl alle +{ip! i). 
This cstimatc is not optimal for the velocities, since optimality would require 
second order convergence. [] 
To prove (11.8) with the choice (11.10), we recall (see [GR]) that there is 
a constant C such that for all qéH there exists ve[H}(&)|* such that 
(t.lla) div v=q, 
(11.11b) — IivilyssCligilo. 
Note that this result proves the validity of the following analogue of (11.8) 
for the continuous Stokes problem: 
(11.12) sup (4 BY) 5 Hallo VqeH. 
vev (|vl1 


Now, for a given qeHn, let veV satisfy (11.11) and define vae Vp as the 
following interpolant of v: 


(i1.13a)  vy(P)=0¢P) for P a corner of KeTy. 
(11.13b)  [vnds=fvds for all sides S of T), 

5 $ 
(11.130) vydx=fv dx for all KeTh, 

kK kK 


where Ve Vj is defined by (V(v—¥), Vw)=0 Vwe Vp. It is then easy to see that 
vnllisCllvlli (see Problem 11.4). Using Green’s formula, (11.11a), the fact 
that qeHy is constant on each KeT) and (11.13b), we now have 
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llqile = (q, div v) = © fadiv v dx 
KK 


Ef qviengds== f qvacng ds 
K dK KOK 


i 


= Jqdiv v, dx = (q, div vy), 
: fa n dx = (q ) 
with nx denoting the outward unit normal to 8K, KeT),. But, recalling 
CLL.11b}, we have 
Illy SC}lvlla<Cligilo. 

and thus 

(q. div vi) oe (q, div vp) 

llallo Ivall 


which proves that the stability estimate (11.8) is satisfied in the case 
(11.10). O 


llqlo= 


Example {1.2 The simplest example of a mixed method for Stokes problem 
is probably given by the Q1—Qp method where 


Vn={veV: vx¢[Qi(K)|, VKeT;,}, 
Hy={qeH: ylkeQo(K), VK eTh). 


This method does not satisfy the stability inequality (11,8) since the pressure 
space is “too rich”. Despite this fact it is possible to prove the velocity estimate 
(see [JP]), 


|Ju-unlli<C inf |Ju-vily. 
veV, 
However, the pressurcs pp may not converge to p. To oblain convergence one 


has to filter out some unstable components of the pressure by local smoothing 
(eg averages over groups of four neighbouring squares K, see [JP]). O 


Example 11.3 A good method bascd on square (or quadrilateral) elements 
K is given by 


Vn={veV. vike[Q.(K)P, VKeTh}, 

Hn={qeH: qlkePi(K), WKeTh}. 
This method, the Q2-P; method, satisfies the Stability inequality (11.8) and 
the spaces V, and Hy are well-balanced. Note that the corresponding Q3-Q1 
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method suffers from pressure instabilities similar to those of the Q;-Qo 
method discussed in the previous example. O 


Remark 11.1 introducing bases {@, .. .. @n} and (Wi... .. pm} for Vy and 
Hp, respectively, the discrete problem (11.3) can be written in matrix form 
as 


(11.14a) AS—BO=F, 
(11.14b) RTE=0, 
where A=(a,)), B=(b,), F=(Fj), with 
aij=a( qu.) by=(W, div @;), Fy= ffepjdx, 


and & and 6 are the coordinates of uj, and pp. 

In order for 6 to be uniquely determined from (11.[4a), we clearly need 
to have dim V,2dim II. The same demand comes from (I1.14b) since we 
want the space {veVp: (div v, q)=0 WqeHn} corresponding to {neR*: 
B™=0} to be rich enough to contain a good interpolant of u. 

The system matrix in (11.14) is not positive definite and thus it is not so 
clear how to solve (11.14) iteratively in an efficient way, neither is it clear that 
Gaussiai elimination without pivoting will work, One way out of this difficulty 
is to replace (11.14b) by the perturbed equation 


(11.15) e6+B™E=0, 


with ¢ a small positive constant (ef Problem 11.3). After elimination of 4, this 
leads to the following positive definite symmetric problem in the velocity 
variable E only 


(11.16) (A+ LBBE=F. 


The condition number of this problem increases with decreasing e and may 
require double precision in Gaussian elemination (for accuracy reasons we 
would like to choose eg ¢=O(h?) in the case of Example 11.1}. O 


Mixed methods may be used for problems other than the Stokes equations, 
eg for the elasticity equations, Maxwell's equations and the plate equations. 
in these cases the problem is formulated as a system of equations and the 
different unknowns are independently approximated. For more information 
on mixed methods we refer to [Br], [GR]. 
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Problems 
11.1. Prove the error estimate (11.9) for the method (11,3) under the 
assumption (11.8). Hint: Write (11.3) in the form: Find 


@n=(Un. Pr)€ Pr=VnX Hh 
such that 
Bn, PI=L(y), Vpe Ph, 
where 
B(A, p)=(Vw, Vv)-(r, div v)+(q, div w), 
LOW)=C, v), O=(W, 1), W=(¥, q)- 
Next prove that with ||qpl/=[|[vil}+ Ilqllal!”, 
sup BLY) schiall, vocds, 
ped, | |b 


by choosing for a given @=(w, r), the function p=(w+6z, r), with 
zé¥V}, chosen so that 

~(div z, r)2eclirl!3, jlzllr=[!rllo. 
and 5>0 conveniently chosen (sufficiently small). 

11.2 Determine the matrix equations corresponding to Example 11.2 in 
the case of a uniform subdivision of the unit square and interpret 
the resulting method as a difference method. Verify that the space 
{qeHh: (q. div v)=0, Wve Vy} contains a pressure that alternatively 


takes the values +1 on acheckerboard pattern. [s the solution of (11.3) 
unique in this case? 


11.3 Consider the following (cf (11.15)) perturbed variant of (11.3): Find 
(Un,Pn)€VnX Hh such that 


(Yun, Vv)—-(pn, div v)=(f,v) WvevVa, 
h?(Vq. V ph) +(q.div up)=0 VgeMh, 


where now HyacH'(Q). Prove without using the stability condition 
(11.8) an error estimate for |lu—unlli (cf [BP], [HFB]). 


11.4 Prove that there is a constant C such that if v, € Vy is defined by (11.13), 
then |vpilissCl|v|l;. Note that GeV) is introduced because we cannot 
guarantee that |v(P)|<C}[vili' (a). 


12. Curved elements and 
numerical integration 


12.1 Curved elements 


In our applications of the finite element method so far we have used piecewise 
linear boundary approximations. For example, in the case of a two- 
dimensional region Q we approximated the boundary F of Q with a polygonal 
line. The corresponding error is of the order O(h?) where h as usual is a 
measure of the size of the finite elements. To achieve higher order of 
approximation one may approximate the houndary with piecewise polyno- 
mials of degree k22 and in this case the error due to the boundary 
approximation will be reduced to O(h**?). In a “triangulation” of the region 
Q. the triangles (or quadrilaterals) close to the boundary will then have one 
curved side, sce Fig 12.1. 


al 


Fig 12.1 


A “curved” element may be obtained in principle as follows: Suppose 
(KK, Px, 2) isa finite clement (cf Section 3.4), where K is the reference triangle 
in the (%1, %2) — plane with vertices at 4'=(0,0), 42=(1,0) and 49=(0.1). For 
simplicity let us suppose that the degrees of freedom are of Lagrange type, 
te, S is a set of function values at certain points 4ieK, i=1,..., m. Let now 
F be a one-to-one mapping of K onto the “curved triangle” K in the (x), x2) 
— plane (sce Fig 10.2) with inverse F~!. 
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Fig 12.2 


We now define 
(12.1) Px= {pi p(x)=P(F- (x). xeK. pePR}. 
(12.2) YK={the values at a'=F(4!), i= 1... .. m). 


It is then easy to realize that (K, Px, 2x) indeed is a finite element. The 
functions pePx are defined through the inverse mapping F-!: KK and the 
polynomial functions p: KR. pePg. If the mapping F=(F), F2) is of the 
same type as the functions in PR, ic, if FiePR, i=1, 2. then the element (K, 
Px, Zk) is said to be of isoparametric type. In general the inverse mapping 
F7-) is not polynomial, unless K is a usual triangle in which case both F and 
F~! are linear, and thus the functions pePx are not polynomials in general. 
We will now study a concrete example in more detail. 


Example 12.} Let K be the reference triangle with vertices at a!. i=1, 2,3 
and let 4!, i=4, 5, 6, denote the mid-points of the sides of K. Further, let 
PR=P2(R) and let © be the values at the nodes a’, i=1, . ., 6 (cf Example 
3,2) and let Ge P2(K). i=1,..., 6. be the corresponding basis functions, so 
that Gi(B)= 84. Suppose now a 1... .,6, are the points in the (x1. X2)-plane 
given by Fig 12.3. In particular, a* and a® arc the mid points of the straight 
edges ala? and ala3, and a) is slightly displaced from the straight line a2a°. 
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x> 


Fig 12.3 


Let us now define a transformation F by 
F(%)= 3 ali, eK, 
and let us write 
K=F(K)={xeR?: x=F(X), &eR}. 


Then we clearly have that F:R > K and F(4)=a), j=1, —_., 6, ie, the points 
aj in the %-plane are mapped onto the points ai in the x-plane. We will now 
consider the following questions: 


(a) Under what conditions is the mapping F:R-—> K one-to-one? 


(b) How can we compute the element stiffness matrix corresponding to the 
curved element (K, Px, ZK) where Px and 2 are defined by (12.1) and 
(12.2)? 


(c) What is the interpolation error using the functions in Pq? 


(d) How can we construct a finite clement space Vj using the element 
(K, Px, Zk)? Is it true that VpcC%(&)? What is the global error? 


(a) When is F one-to-one? 


The mapping F is locally one-to-one in small neighbourhood of each point 
geK if 


(12.3) det J(4)#0, eK, 


Po 
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where 
BF, a 
OR, Oz 
j= 
af OF 
SR Oz 


is the Jacobian of Fand det J is the determinant of J. In general the condition 
(12.3) does not guarantee that the mapping F:K— K is globally one-to-one 
(cf Problem 12.2). In our case, however, the sides of K are mapped in a 
one-to-one fashion onto the sides of K and one can then show that (12.3) 
implies that F:K— K is one-to-one, ie, for each xeéK there is a unique ReK 
such that F(%)=x. To check if det J(%) #0 for eK, we split the transformation 
F in two transformations F and F, 


F(&)=F(F(®). 


according to the following figure: 


1 
3) 34 ae pb! vf be 
Fig 12.4 


Ilere F is the affine mapping that maps the vertices bi=ai, j=1, 2, 3 on the 
vertices ai=(aj, aj), j=1, 2. 3. ie, 


F(y)=By+, 


weal. ahead al 
Be 5 sf PF] al 
ajay a2—@2 a3 


where 
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Since by assumption the points a!, a? and a? are not situated on the same 
straight line, we have that det B40 and F is therefore one-to-one. 
It now remains to analyze the mapping F. We easily see that 


Fi(R) = + dik ho. i= 1, 2, 
where 
dj=4b3—2. 
The Jacobian J corresponding to F is given by 


14+d)%2 dik 
J{x)= 
yk. Ledoky 


from which it follows that 
det J(&)=1+dy%+d2%), 


andthus detJ is linear in &. Therefore detJ is positive in R if det J is positive 
at the vertices 4), j=1, 2, 3. We have 


JOO=1, JO O)=1+d2,  JO.1)=140). 
which proves that det J>0 in K it di>-1, i=1. 2. ie, if 
boot ist, 2. 
4 


We thus conclude that F is one-to-one if b> and a° lie in the shaded areas in 
Fig 12.4, and thus in particular if 45, the mid-point on the straight line a2a3, 
is close enough to a5. Hence the original mapping F 1s one-to-one under the 
same condition. In particular, if the element K with one curved edge is used 
to approximate a smooth boundary, then the distance |a5—A5! (cf Fig 12.1, 
12.4 and Problem 12.2) will be of the order O(hz), where hx as usual is the 
diameter of K. Hence a5 will be close cnough to 4° if hy is sufficiently small, 
and thus we conclude that the mapping F will be one-to-one for sufficiently 
fine triangulations. 


(b) Computation of the element stiffness matrix 
The local basis functions on K are given by 
O()= OF x)), J=1, 6 


where @,, j=1, . . ., 6, is a usual basis for PR=P2(K). If for example the 
underlying differential equation is the Poisson cquation (1.16). thei we base 
to compute the integrals 


(12.4) aS Va 7 ox, j=l 6. 


To this end we note that by the chain rule 


8G _ 9 ee ead Bb, Bk1 _, Bi OR? 
oh = (pF aa SS 
Bx, ax ME OM ae any Ra 3s 
so that 
Vga TVG. 
where J-! is the transposed Jacobian of the mapping F~', 
ah 3% 
Ox) (9x) 
JoT= 
Ox2 = Ox2 


If we now transform the integral in (12.4) to an integral over R using the 
mapping F:K > K, we get, 


ap=JOTV Gd) (TT @) [det J]dz. 


Further, by a simple calculation 
1 


-Ta(j-lyt= Jo. 
: a) det °° 
where 
oF _aFs 
OR2 ok 
Jo- ’ 
oF: Fi 
SR OR 


so that finally 
» * dk 

i) ak= [ (loV >) -UoVO) ——. 
(12.5) ¥ {0 $)- WoO) Gal 
Thus the matrix element ak can be computed by evaluating an integral over 
the reference element K. We see that the integrand is a rational function r(%) 
(ie, 1(8)=p(8)/q(X) with p and q polynomials), and thus in general it is difficult 
to evaluate the integral (12.5) analytically. In practice this integral is most 
conveniently cvaluated approximately by some appropriate quadrature for- 
mula, as discussed below. 
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(c) The interpolation error 


Given a function v on K, we define the interpolant mvéP, in the usual way 
by requiring that xv(a')=v(a'), i=1,.. .. 6. If K is our usual triangle we have 
from Chapter 4 


(12.6) Ilv¥-avlieag S$ Ch IivitaiKy, OSssrs3. 


The estimate also holds for a curved triangle K if K is not “too curved"’; more 
precisely if |a5—a5|=0(h%) with a5 and 4° the points of Fig 12.4. Thus, in a 
typical application where the curved elements approximate a smvoth bound- 
ary curve, then the estimate (12.6) will hold. 


(d) The corresponding space Vy, 


Let Ty={K} be a triangulation of Q using the finite element (K, Px. 2x) where 
the “triangles” K may have one or more curved edges. Let 9) be the union 
of the elements in Ty. Then Q) is an approximation of @ with piecewise 
quadratic boundary (see Fig 12.1). We now define in the usual way the finite 
element space 


Vn= (ve HX Qh): vixePK. KeTh}. 


It is then easy to sec that if v|KeP« for KeT), and v is continuvus at the node 
points of Th. then v is continuous across all element edges and thus veH1(Qy). 
Thus, we may choose the values at the node points as global degrces of 
freedom. If we use this space to discretize, for example, the Poisson cquation 
(1.16), then we have the following error estimates 


27a) (tu-ualla'ea,ySCh?fullria), 
(12.76) llu-unlli,ay<Ch4iullw~ay). 2 


12,2 Numerical integration (quadrature) 


We have seen above that the elements in the element stiffness matrix for a 
curved element contain integrals that may be difficult to evaluate exactly. We 
meet the same difficulty in the case of nonlinear problems or differential 
equations with variable coefficients. For example in the heat equation of 
Example 2.7 the elements of the clement stiffness matrix are 


2 ag, 9@ 
12.8 ak=( = k, D 4 
(12.8) iy {2 ml) axe x, 


where km(x) is the heat conductivity in the xm-direction at the point x. 
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To evaluate such integrals in practice one would use a suitably chosen 
numerical quadrature formula of the form: 


(12.9) Jfyde~ ¥ f(y))w, 
K j=] 


where the wj, j=l... .. q, are certain weights and the yi are certain points 
in the clement K. 

To estimate the error committed in using the quadrature formula (12.9) we 
check for which polynomials p the formula (12.9) is exact, ie, for which 
polynomials p we have 


Jpdx= E ply!) 
K j=] 


If (12.9) is exact for peP,(K), then the quadrature crror can be cstimated as 
follows (if r>0): 


0 
(12.9) [ teaaes Stel ane rae) f|Detfax. 

Let us now give some simple quadrature formulas. Here r indicates the 
maximal degree of the polynomials for which the formula is exact. Further, 
ai (i=1, 2, 3) are the vertices of the triangle K, bi (j=1, 2, 3) denote the mid 
points of the sides of K and a!?3 the center of gravity of K. By Q we denote 
a rectangle with sides parallel to the coordinate axis of lengths 2h, and 2h2 
and centered at the origin. We then have, for example. the following 
quadrature formulas: 


[fdx~f(a!?3) arca(K) /\ r=1 
K 

[tax $ (bi) ES) ra r=2 
K I= 


ete 2 “reat 


fit z (f(a) ———- + (bi) 


+f(al23) 9 ment) /» r=3 


fof eB oS 


+f[- ut a. Dt] rs3 


Vi VB)! 4 
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Remark Let us return to Example 12.1 where we considered isoparametric 
quadratics. One can show that if the integrals (12.4) are computed with a 
quadrature formula which is exact tor polynomials of degree r=2. then the 
error estimate (12.6) holds, that is |u—unl|H'¢<2,)=0(h?). Further, to compute 
the integrals (12.8) using quadrature one should use quadrature formulas 
which are exact for polynomials of degrec 2r—2 when using piecewise 
polynomials g,¢P(K). 0 


Problems 

12.1 Consider the mapping F:K-+K. where F(r.@)=(rsin @.rcos 8), 
K={(r, 6): [Srs2, 0S@<2n}, K={xeR*: 1s|xl<2}. Show that the 
Jacobian of F is different from zero in K and that F:K— K is not 
one-to-one. This shows that the condition that the Jacobian is non-zero 
is not sufficient to guarantee that a mapping is globally one-to-one. 


12.2 Let T be a circle with diameter d and let I, be a polygonal 
approximation of [ with vertices on T and with maximal side length 
equal toh. Prove that the maximal distance from Ito [y is of the order 
h2/4d. 


12.3 Let R be the unit square with corners a#=1,...,4, let PK=Q,(R) and 
let E be the degrees of freedom corresponding to the values at the 4’. 
Prove that if K is a convex quadrilateral, then we may define an 
isoparametric finite element (K, Px, 2x) by (12.1) and (12.2). This 
finite element is frequently used in applications. 
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13. Some non-linear problems 


13.1 Introduction 


In this chapter we consider some applications of finite clement methods to 
non-linear problems in continuum mechanics. We will just indicate some 
aspects of this extremely rich problem area. We first consider a class of convex 
minimization problems generalizing the quadratic minimization problems 
studied in Chapters 1-7. These problems correspond tu non-linear elliptic 
partial differential equations and so called variational incqualities. We will 
then discuss a non-linear parabolic problem modelling eg heat conduction 
with heat conduction coefficient and heat production term depending on the 
temperature. The finite element methods of Chapters | to 8 may be directly 
extended to these problems. Finally we consider extensions of the streamline 
diffusion method of Chapter 9 to the Eules and Navier-Stokes equations for 
an incompressible fluid, and to a model problem for compressible flow, 
Burgers’ equation. 

In all cases the discrete problems obtained after application of a finite 
element method, consist of non-linear systems of equations to he solved. We 
also comment on some iterative methods of Newton type for the numerical 
solution of these systems. 


13.2 Convex minimization problems 
13.2.1 The continuous problem 


We have seen that many linear stationary problems in mechanics and physics 
may be formulated as minimization problems of the form 


(13.1) Min Fly), 
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where V is a Hilbert space and I:V— R is a quadratic functional. We will 
now briefly discuss generalizations of (13.1) to convex functionals F that are 
related to non-linear elliptic problems. To formulate these generalizations we 
need the following definitions (as before V is a Hilbert space with scalar 
product (.,.)v and norm ||-{ly). A set KEV is said to be convex if for all v. 
weK and 0<a1, one has 


(13.2) av+(1—-a)jweKk. 


The condition (13.2) states that if v. wéK, then all points on the straight line 
between v and w also belong to K, ef Fig 13.1. 


Fig 13.1 


We say that a functional F:K > R defined on the convex set K is convex if 
for all v, weK and O<a<1, une has (cf Fig 13.2) 
(13.3) F(av+(1-a)w) $ aF(v)+ (1-@)F(w). 


The functional F is said 10 be strictly convex if equality holds in (13.3) only 
for a=0 or a@=1. 


aF(v) + (1-a) F(w) WB -— - ---—- -3 
Flav + (1-a) w) 


I 
I 
. * 
v av+ (1-a)w w 


Fig 13.2 
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Example 13.2 A linear functional is evidently convex. The quadratic func- 
tional Feat IIv|[2.. veV, is strictly convex (cf Problem 13.4), U 


We further say that a set K<V is closed if xjeK and ||xj—x)- y—> 0 as j > 
imply that xeK. Finally we say that a functional F:K 3 R is continuous it xeK 
and |[x;-x|}y—> 0 as j> «©, where xeK, imply that F(x;) > F(x) as j> . 

We consider minimization problems of the form 


(13.4) Min F(v), 
veK 


where KcV is a closed convex set and F:K—>R is convex and continuous. 
If K#V, then (13.4) is a constrained minimization problem; we then seek to 
minimize F(v) under the side condition ve K. If K=V we have an unconstrai- 
ned minimization problem. Problems of the form (13.4) are related to 
variational inequalities, see [DL], [GLT]. 

Let us now formulate a general result concerning existence and uniqueness 
of solutions to problems of the form (13.4). 


Theorem 13.1 Suppose K is a non-empty closed and convex set in the Hilbert 
space V and that F:K-»R is convex and continuous. Suppose that K is 
bounded, ie there is constant C such that |[v|lv<C, VveK, or that F(v)—> & 
as ||vllv—> x. Then there exists a ueK such that 


F(u)=Min F(v). 
veK 
If F is strictly convex, then u is uniquely determined. 


We do not prove this cesult here. For a (short) proof we reter to [ET]. We 
now give some examples of problems in mechanics and physics that may be 
formulated in the form (13.4). 


Example 13.2 Our standard problem (2.4) from Chapter 2 has the form (13.4) 
with K=¥V and 


FW)=} a(v, »)-L(), 


where a(.,.) and L(.) satisfy the conditions (i)—(iv) of Section 2.1. Here F 
is strictly convex and continuous and by (2.2) and (2.3), 


F( 


#SiIMB—Allsiv=lbily|Flviv-A ) 
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as |lv|ly—> ©. ‘Thus, the assumptions of Theorem 13.4 are satisfied and 
existence and uniqueness of a solution of (13.4) follows in this case. Note in 
particular that Theorem 2.1 is essentially a special case of Theorem 13.1. 0 


Example 13.3 Let V=H}(Q) where Q is a bounded domain in R? with 
boundary I’. Let ye H!(Q) be a given tunction defined on 92 such that y(x) <0 
for xe and define 


K=(veH{(Q): v2 in 2). 


Clearly K is convex and one can also show that K is closed. Further let F be 
defined as in Example 13.2 with a and L given by Example 2.3. Then the 
assumptions of Theorem 13.1 are satisfied. The unique solution ueK of (13.4) 
in this case represents e g the deflection of a membrane fixed at its boundary 
under the presence ot an obstacle given by the function wp, ‘The side condition 
uéK, ic, uy in 2, corresponds to the fact that the membrane cannot 
penetrate the obstacle cf Fig 13.3. For more examples of similar nature, see 
[DL], [GLY]. 


said 


Fig 13.3 


kxample 13.4 Consider the non-linear elliptic problem 


3 {yy au) a 2) Bu) _e. 
(35) 0 = (vv) - 2 {var Ze] f in Q, 


u=0 onT, 


where Q2 is a two-dimensional bounded domain with boundary [ andv:R— R 
is a given positive fuaction. This problem is obtained, for example, from the 
following Maxwell's equations modelling a two-dimensional magnetic field 
problem: 


V xH=j, 
(13.6) B=y(IB))H, 
div B=0, 
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where H=(Hj, Hp. 0) is the magnetic tield, B=(R,, Bz, 0) is the magnetic 
flux density, 4 is the magnetic permeability assumed to depend on {Bl? and 
j=(0, 0, -f) is a piven electric current density. The problems (13.5) and (13.6) 
are connected through the magnetic vector potential A=(0. 0. u) related to 
B by B= V XA, so that in particular |B/?=|Vul?. and through the relation 


“5 Now suppose that 


3 
(3.7) W=FP=0'8), B20. 
Then (13.5) corresponds to the minimization problem 
(13.8) Min F(v) 
veV 
where 
(13.9) F(v)=1 fg PvP \dx— ftv dx, 
20 Q 
V=HYQ). 
To see this formally, note that if u is a solution of (13.9), then for any veV 
2(0)<g(e)=F(u+ev) WeeR, 
so that g’(0)=0 which gives 
fordvubyve- Vv=sivdx  Wev. 
Q 


But, using (13.7), this is a weak formulation of (13.5). Suppose now eg that 
v is non-decreasing and for some positive constants vo and vy) we have 
wsv(6)<v1, £20. Then @ is convex and it is easy to see that F given by (13.9) 
is convex, continuous and that F(v)> @ as ||v|lv>«. O 


Example 13.5 Consider the minimal surface problem 


(13.10) Min FO) 


vog on 
where 


Fv)= f(1+/VvP2)2 dx, 
Q 


where Q is a bounded domain in the plane with boundary and g isa smooth 
function. Here F(v) represents the area of the surface given by the graph of 
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the function v. he solution of (13.10) corresponds to a soap film spanned 
by the graph of the boundary function g, cf Fig 13.4. 


Fig 13.4 


Let us now check if Theorem 13.1 may be used to prove existence of a solution 
of (13.10). It is natural to start with V=H!(Q) and K={veH!(Q): v=gon 
T}. We easily check that F: K— R is convex and continuous but we cannot, 
however guarantee that F(v)—> © if ||vi|vy— . cf Problem 13.5. To be able 
to prove existence of a solution of (13.10), using a variant of Theorem of 13.1, 
the space V has tu be chosen larger than H!(Q), (basically, V would be the 
space of functions on Q whose first derivatives are integrable or more 
precisely, the functions on Q of bounded variation cf [T2]). 0 


Example 13.6 A problem similar to (13.10) occurs as a model for the 
displacement of a body made of an elastic, perfectly plastic material under 
a load f. In two dimensions this problem takes the form (13.4) with now 


F(v)= J (i Vv(x)|)ds— fv dx, 
Q Q 


r if O<r<1, 


o(r)= 

scr if r>1. 

2 
Again we need to take V larger than Hj(Q) to obtain existence in general (cf 
Example 13.5). Also f has to be small enough to avoid having 
inf F(vj=—, which corresponds to collapse of the elasto-plastic body and 
non-existence of a solution, cf [T2|. JS2]. 0 
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Example 13.7 An alternative formulation of the elasto-plastic problem of the 
previous example. using stresses instcad of displacements as unknowns, is as 
follows (cf Problem 2.9): 


a th 
Min= 2. 
min bal 
where 
K=HyNPcH, 
Hr={qeH: divq+f=0 in Q}, 
H= (q=(q1.q2): gi€Lo(2)} =[La( QP. 
P={qeH: |q(x)[S1, xeQ}. 
Hlal?=SlaQpPax. 
Q 


One can easily show that K is closed in H and the existence of a solution follows 
from Theorem 13.4 if K is non-empty. ‘The latter condition will be satisfied 
if again the load f is below the collapse load. O 


13.2.2 Discretizations 


Suppose now we have a convex minimization problem of the form (13.4). A 
discrete analogue of this problem is obtained by replacing K with Kx=KO Vj, 
where Vj) is a finite-dimensional subspace of V. This leads to the finite- 
dimensional minimization problem: 

(3.11) Min F(v). 


veK, 
In Example 13.3 above we may eg choose 
Kp={veVy: v= in Q}, 


where VicH) is a standard finite element space of piecewise linear functions 
on a triangulation Tp. Introducing as usual a basis {p). ..., Pm} for Vp, the 
problem (13.11) may be written as a convex minimization problem in RM of 
the form 


(13.12) Mia fn) 


where QeR™ is a closed convex set and f:RM-+R is convex. If 
Q=RM(QERY) then (13.12) is an unconstrained (constrained) convex mini- 
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mization problem in RM. We note that the typical problem (7.1) of Chapter 


7 has the form (13.12) with Q=R™ and f quadratic, (m= “An—b-y. 


13.2.3 Numerical methods for convex minimization problems 


Let us very briefly indicate some methods for the numerical solution of convex 
minimization problems of the form (13.12), We shall then use the notation 
of Section 7.1 with in particular f’ and f” the gradient and Hessian of f (which 
we assume to exist). Let us then first consider the unconstrained case with 
Q=R®™. In this case it is easy Lo show that EeR™ is a solution of (13.12), ie, 
£(&)<f(q) VneR®, if and only if 


(13.13) f(§)=0. 


Further, if f is strictly convex, corresponding to {"(n) being pasitive definite 
for all neR™, then the solution § is uniquely determined. 
For the numerical solution of (13.12) with Q=R™ we shall consider, as in 
Chapter 7, iterative methods of the form 
EktIath+apdk, k=O, 1... 


where as earlier a, is a step length, dé is a search direction and &° is an initial 
approximation. We shall first give some examples of methods with dk= 
—Hyf'(&*), where Hy is an MXM matrix. As in Chapter 7 we sce that dis, 
a descent direction if Hy is positive definite. 


(a) The gradient method with optimal step lengih 

In this method one chooses, of course, 
dk=—-('(E*), ay= af, 

where 


£(E¥+ a0P! dk) =min £(&k+ adk), 
az0 


As in the special case of a quadratic functional one can prove that the rate 
of convergence of this method is inversely proportional to the condition 
number of the Hessian f"(&). 


(b) Newtons method 

In this method we have 
dk=—£"(ES)—14°(E%), 
ax=l, 


Note that in this case one has to solve a linear system of equations with 
coefficient matrix f"(Ek) at each step. If f"() is non-singular (thus in particular 
if £°(&) is positive definite) and the third derivatives of f are bounded, then 
Newton's method will converge quadratically (that is very quickly) in a 
neighbourhood of &. In this case there is a 6>0 and a constant C such that 
it /E*—§&/<0, then 


let Be clet—EP. 


The main problem with Newton’s method is to get a sufficiently good initial 
approximation. Once this is achieved one gets the solution with very few 
iterations. To get such an initial approximation &* one may have to choose 
o<1 to start with. In this case the method is a damped Newton method. 


(c) Quast-Newton methods 
These methods are variants of Newton's method of the form 
dk =—Hyf’(&*). ay =ahPt, 
where Hy is an MXM matrix which may be viewed as an approximation of 
"(E*)~!. In the simplest case one may choose 
Hy=f"(E2)71, k-0, 1, - 
which corresponds to the classical modified Newton's method, or one may take 
Hy=C7!, k=0, 1, 


where C=E/E is an approximation of eg {"(&°) with E sparse, in which case 
we get preconditioned variants of the gradient method, cf Section 7.4. Ina 
true quasi-Newton method the matrices H, are successively updated in a 
simple way using the fact that the difference f’(k)—f' (Ek!) gives information 
about f£’(&*) and then the II, become better approximations of f’(E)~! as k 
increases. The quasi-Newton methods are very efficient on large classes of 
problems (see eg [MS]). 
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(d) Generalized conjugate gradient (Fletcher-Reeves) 
This method rcads: 

ek ‘ TEKS ck, 

=a, 

dk+1=— pk+148 gk 
B= git] + ght! 

gk. gk 
where gk=f"(E*) and d!=~p9. Usually a restart with dk=—gk ig made in this 
method at certain intervals, eg every M& step. 
Finally, let us just mention that to solve the problem (13.12) in the case 

Q#R™, ic, the case of a constrained minimization problem, one may use 


different methods from nonlinear programming such as, for example, penalty 


methods, projected gradient methods, Lagrange multiplier methods, duality 
etc. see eg [G], [GLT]. 


13.3 A non-linear parabolic problem 


Let us consider the following non-linear parabolic problem: 


3 ‘i é 

Spodiv (a(u) Vu)=i(u) in QxI, 
(13.12) us0 on Px], 

u(. .0)=ug, 


where a:R—R is a given function satisfying ap¥a(r)Sa), reR for some 
positive constants a; and f:R-+R is given. This problem mudels heat 
conduction with heat conduction coefficient a and heat production f depend- 
ing on the unknown u. Systems of equations of the form (13.12) also model 
eg chemical reactions. 

A weak formulation of (13.12) reads as follows: Find u(t)e V=Hy(Q), tel, 
such that 


(a(t), v)+a(u(t); u(t), Y=(f(U(D), vy) Wve, tel, 


(13.13) 
u(O)=u, 
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where 
a(u; w, v)=Ja(u) VwVv dx, (v, w)= Jvw dx. 
Q Q 


Diserctizing (13.13) by extending the backward Euler method (8.29), we get 
the following discrete analogue of (13.13): Find Une Vy, n=1. 2, ..,N. such 
that for n=1,...,N, 


(13.14) (U9-U"!", v)-+kpa(U"; U", y)=knl(U), ¥) WevVn, 


where U9=uo. 

Under the assumption that f and a are globally Lipschitz continuous (i ¢ for 
some constant C, |f(r)—f(s)|<C|r—s| and similarly for a) the error estimate 
(8.42) may be generalized to (13.14) (see [EJ1]). Also the discontinuous 
Galerkin method (8.35) may directly be extended to (13.13) (ef Problem 13.2). 


13.4 The incompressible Euler equations 
13.4.1 The continuous problem 


Let Q be a simply connected bounded domain in R? with boundary [. Let 
us recall the Euler equations for an incompressible inviscid fluid enclosed in 
Q: Given g and uy find the velocity u=(u1, U2) and the pressure p such that 


(13.15a)tytue Vut Pag, in Qx1, i=1, 2, 
Xi 

(13.15b) div u=0 in QX1, 

(13.15e) uu n=O on Fx], 

(15.15d)_ u=ug in Q for t=0, 


where as usual n=n(x) is the outward unit normal to Tat xe and 1=(0, T). 
It is known that (see eg [K]) if g, uo and T are smooth and div ug—0 and 
ug’ n=0 on I, then (13.15) admits a unique smooth solution for any T The 
problem (13.15) is an example of a nonlinear hyperbolic problem. Note that 
the boundary condition u- n=0 states that no fluid particles enter or leave the 
domain 2. 

In this section we will briefly consider two possible ways of discretizing 
(13.15) using streamline diffusion finite clement methods. The first method 
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is based on a reformulation of (13.15) using the vorticily w and stream function 
p as variables while the second method is based on the velocity-pressure 
formulation (13.15). 

We begin by recalling (cf Section 5.2) that since & is simply connected and 
div u=0 in Q there is a unique stream function \p(x, t) such that for tel 


a-roty= (2, - 28) in Q, 
3x2 Bx] 
w=0 onl. 


Alternatively w may be specified as the unique solution of the Poisson 
equation: 


—Ay(.,t)=o(.,t) in Q, tel. 


(13. 6a) 
wl .N=0 onT, tel, 
where 
w=rot u= 282381 
Oxy 3x2 


is the vorticity of the velocity field u. Applying now the opcrator rot just 
defined to (13.15a). we obtain the following reformulation of (13.15): 
Find w: 2xI]—R such that 
o+u(o): Vo=ft i 
(13. L6b) ” hago 
wW=0y in Q for t=0, 


where f=rot g, wo=rot up and u(w)=rot wy. where p satisfies (13.16a). We 
see that (13.16b) formally has the form (9.3) with a coefficient B=u(@) 
depending on the unknown solution w. Notice that we do not have to specify 
any boundary conditions for w in (13.16b) since by (13.15c), u-n=0 on T. 

We shall now indicate how to extend the streamline diffusion and the 
discontinuous Galerkin method of Chapter 9 to the nonlinear hyperbolic 
problem (13.16). 


13.4.2 The streamline diffusion method in (w, w)-formulation 


Let 0=tg<t). . .<tw=T be a quasi-uniform subdivision of I into intervals 
Im=(tm-1, tm) of size h and introduce the “slabs” Sq=Q2XIm. Let further 
Tn={t} be a quasi-uniform finite element triangulation of Q with elements 
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+ also of size h, and introduce for a given integer r=1 and m=1,....N the 
finite-dimensional spaces 


(13.17) U™=(veH!(Saq): v[ePA(t)XPr(Im), VK=tXImn, TET}, 


ie, U™ consists of continuous functions detined on S,, that are piecewise 
polynomial in x and polynomial in t of degree at most r. We also introduce 
the spaces 


(13.18) WM=(peH!|(Sq): plxePrai(t)*Py(Im) 
VK=1XIm, TET, and p=0 on PxIn}, 


ie, ™ consists of continuous functions on Sj, that are piecewise polynomial 
in x of degree c+1 and polynomial in t of degree r. We shall further use the 
following notation analogous to that of Section 9.9: 


(w, v)™= J wv dxdt. 


Sn 


<w,v>™= f w(x, tm) ¥(X, tm) dx, 
Q 


va(x, th= lim v(x, t+s), IvyJ=v;-v 
so 


The streamline diffusion method for (13.16) can now be formulated as 
follows: Find (w™, p™)eU™x P™, m=1,..., N, such that form=1, ..,N, 


43.19a) (@™+U"(p™)- Yo™, v+h(v+umapry- Vv))™ 
+<[o™], vy. >™ l= (f, vth(V+um(y™)- Vv ve, 
(13.19b) (Vp™, Vq)y™=(0", @)™ — Vpe B™, 


where @” =o for t=0 and u™(p™)=rot y™. Here and below we also use the 
convention that w"=™! for t=tm-1. 


13.4.3 The discontinuous Galerkin method in (w, 1p)-formulation 


Let B:Q—>R? be a direction field such that B-n is continuous across 
intcrelement sides of the triangulation Tp with normal directions n, We define 
for each K=tXIm, TET, with boundary 3K: 


9K-(B)= (Cx, theOK: n(x, t)- B(x, t)+ni(x, t)<O}, 
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where (n(x, t), n(x, t)) denotes the outward unit normal to dK at (x, edK. 
Further let us introduce for m=1,.  ., N, the space 


W™={velo(Sm): viKeP (1) XP (Im), WK=tXIm. teTh}, 


ie, W™ consists of possibly discontinuous functions on Sj, that are piecewise 
polynomial in x and polynomial in t of degree at most r. Let us also for 
(x, t)€9K_(B) introduce the notation 


vi(x, D= lim v(x-tsn -B. t+s), [vJ=v4—-¥-. 


We can now formulate the discontinuous Galerkin method for (13.16) as 
follows: Find (o™, y™eW"xW™ m=1,  .,N, such that form=1,. ..N 


ZX {f(@™+B-Vomvdxdt+ Ff  [w™]}vitn- B+nlds}=0 
K K 3K (B) 


Vvew™, 
13.20 (Vy™, Yq) =(w™, gr Voewr 
yn, 


where B=rot p™, w° =o for t=0 and we sum over all K=1tXIq with t€Tp. 
. é 5 , ay”. 7 
Note that since p™ is continuous in x, B- aa is continuous across inter- 
aK 


element boundaries S of Th, where 2 denotes differentiation along S. 
5 


13.4.4 The streamline diffusion method in (u, p)-formulation 
For m=1,..., N, we introduce the velocity space 

V™={v: v=rot @, peW™}, 
and the pressure space Q™=U™ where V™ and U™ are given by (13.18) and 
(13.17). We observe that the functions v in V™ satisfy div v=0 in Q, v-n=0 
on T and v-n is continuous across interelement boundaries. 


We can now formulate the following streamline diffusion method for 
(13.15): Find (u™, p™eV™xQ™, m=1,...,N, such that for m=1,...,N, 


(13.21) Ef f[(a™+B- Vur+ Vp™, v-th(v+p. Vv+ Vq))dxdt 
K K 


+ f ({u™),v,)in- B+nlds} = f(g, vth(W+B- Vvt Vy))dxdt 
8K_(p) s, 


Viv, qheV™xQ™, 


26) 


where B=u"™, u®=upo for t=0 and as above we sum over K=tXIm. TET. 


Further, (...) denotes the scalar product in R?. Note that although the 
velocities ve V™ satisfy the incompressibility condition div v=0, the pressure 
is sull present in the formulation (13.21), cf (5.7). Note also that choosing v=0 
in (13.21) gives the following discrete Poisson equation, with Neumann 
boundary condition, for the pressure p” in terms of u'" 


(Vp™, Vq)= = (g—u"—B. Yum, Yq) WqeQ™, 


which corresponds to the following equation obtained by applying the 
divergence operator to (13.15a) 


— Ap=—div(g-u-u- Vu) in &, 
a 2 
oP 2 (gj-dj—-us Vuj)n; on’, 
on ol 


For the methods (13.19)-(13.21) one can prove global error estimates of order 


1 
o(h'”2), see [JS]. 


13.5 The incompressible Navier-Stokes equations 


The extension of the Euler equations (13.15) to the case of a viscous fluid with 
viscosity >0, ie the Navier-Stokes equations for an incompressible fluid, 
reads as follows: Given g and uy find the velocity u and pressure p such that 


(13.22a) titus Vat SP Aung, in QXI, ist, 2, 
13.22b) div u=0) in QXI, 

( ) 

(13.22c) u-n=u-s=0 on FX, 
(13.224) u=ug in Q for t=0, 


where s is a tangential direction to [’. We note that the boundary condition 
u-n=0 in (13.15) is here supplemented by the no-slip condition u-s=0 on 
T requiring the tangential velocity to be zero on T. 

The Navier-Stokes equations (13.22) are an example of a non-linear mixed 
hyperbolic-parabolic system with nonlinear hyperbolic convection terms 
u> Yu, and a linear elliptic viscous terms —pA uj. We will here be interested 
in the case of small viscosity 1. in which case we meet the difficulties in 
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numerical approximation discussed in Chapter 9. If wis not small, then (13.22) 
is dominated by the linear viscous term and our earlicr methods for Stokes 
problem may be directly extended to (13.22), see [GR], [Tl]. 

To extend the streamline diffusion method of the previous subsection to 
the Navier-Stokes equation (13.22) with « small we shall introduce the 
vorticity as an additional unknown. This is needed because the discrete 
velocities in the velocity space V™ are not necessarily continuous in x (the 
tangential velocities may be discontinuous across inter-element boundaries) 
and thus it is not clear how to handle the viscous term Au. We note that if 
o@=rot u then since div u=0, we have 


We now formulate the following streamline diffusion method for (13.22): Find 
(u™, p®, w™MeV"™xQmxQm, m=1,.  .,N, such that form=1,....N, 


(13.23a) £ { f(G™+P- Vu™+ Vp, vto(wtB- Vv+ Vq))dxdt 
K K 
+ f (fu™]. vijin-B+nids+ny f(rot wo™, v+5(V+B- Vvt+ Vq}dxdt 
K (B) K 
= f(g, vr8(V+B- Vvt Vq))dxdt Viv, qvev™xQn™, 
Sa 


(13.23b) (rot @. u™)™=(co™, @)™ — VOEQ™, 


where 8=Ch with C€ a sufficiently small positive constant and as above B=u™ 
and u® =up for t=0. Note that (13.23b) gives a discrete formulation of the 
relation w=rot u together with the no-slip boundary condition u-s=0 on T. 

The method (13.23) is robust, accurate, uniformly stable for 0<p<h and 
suitable for complicated flows. In Example 13.8 below we present some results 
obtained using this method with r=1, ic, piecewise linear velocity, pressure 
and vorticity. For an analysis of (13.23). see [JS]. Numerical results for (13.21) 
and (13.23) are given in [Han]. cf Example 13.8. 


13.6 Compressible flow: Burgers’ equation 


We conchide with an application of the streamline diffusion method to a model 
problem for compressible fluid flow, namely Burgers’ equation: Find u: 
JXI— RK such that 
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3 2 
(3.244) atu - ae = (x, theJxI, 
co x 
(13.24b) (0, 1)=u(1, 1)=0 tel. 
(13.24c) u(x. U)=up(x), xeJ, 


where J=(0, 1), 1-(0, T), 40 and up is a given initial function. [f 1=0 then 
the boundary condition u(0, t)=0 is enforced only if u(Q, 1)20 and u(1, t)=0 
only if u(t, t)<0, corresponding to inflow conditions, Let us use the notation 
of Section 13.4.2 with now Ty= {1} a subdivision of J into subintervals t and 
define 


U™={veU™: v(x, t}=0 tor x=0, 1}. 


The streamline diffusion method for (13.24) with y=0 can now be formulated 
as follows: Find u™e U™,) m=1, ..,.N, such that form=I....,N, 


(13.28) (am4um = veh(vtum Sy)" <[u™], v,>™-120 Wet, 
x 3. 


where u"=ug for t=0. This method can be directly extended to the case 
O<;<h following Section 9.6. 

One can prove (see [JSz1]) that if the finite element solutions u™ satisfying 
(13.25) stay uniformly bounded as the mesh size tends to zero. then the u™ 
will converge to the (entropy) solution u of (13.24) with »=0. For applications 
of streamline diffusion type methods to the compressible Euler and Navier- 
Stokes equations, see [HFM], [HMM], [HM1}, [HM2}, (JSz2], [Sz]. Mcthods 
of this type hold promise to be the first successful theoretically supported finite 
element methods for compressible flow problems with potentially extensive 
applications. In Example 13.9 below we give some results for (13.25) and 
variant of (13.25) with shock-capturing according to Remark 9.6, see [JSz1]. 


Problems 
13.1 Prove the following stability estimate for (13.25) form=1,....N, 


m 
7 2 
jun, tm)=ll2acyt =, ile") (th) Ilia 


nm jun 
2h & fuera ay (leas = \luoll? ay: 


13.2 Extend the discontinuous Galerkin method (8.35) to (13.13). Prove 
a basic stability estimate. 
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13.3. Defining (U, P) on 2x(0, tw) by Uls,=u™, Pils =p™ m=1,...,N, 
where (u™, p™) satisfies (13.21), prove the following stability estimate 
in the ease g=0: 


UG, tw)-!lPq@yt E{ Sf [Ulm Utnj lds 
K 3K (UW) 
+2h f[U+U + VU+ VPPdx) < |juoll2 gay, 
K 


where we sum over elements K=txX In, teTh, m=], ..,N. 


13.4 Prove that the quadratic functional F(v)= 2 I 12 is convex, of 
Example 13.1. 7 


13.5 Prove with the notation of Example 13.5 that there is a sequence {vi} 
such that F(¥j)<1 and |lv|!y— 2% as ia. 


13.6 Prove formally that the problems of Examples 13.6 and 13.7 are 
equivalent. 


Example 13.8 1n Fig 13.5 below we give for a cavity problem the velocities 
obtained by the method (13.23) after 5, 10 and 15 time sleps with r=1 (ie 
Piecewise linear velocities, pressure and vorticity), 8=h=At, u=10-3, given 
inlet velocity=1 and initial velocity=(0 on a 8X16 mesh 


Example 13.9 In Fig 13.6 below we give the result of applying (13.25) with 
and without shock-capturing and with h=V.1 in a case where the exact solution 
of (13.24) with p=0 consists of a raretaction wave and a shock and J is replaced 
by (—©, ©). The exact solution is represented by the dotted line (see [JSz}]). 
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Fig 13.5 Streamline diffusion method for the incompressible Navier-Stokes equations 
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Fig 13.6 Streamline diffusion method for Burgers equation 


o t=5.1 

ie Fret ce ae -) nh : > 
a. Streamline diffusion method for Burgers equation without shock-capturing, d=h=0.1 
“ (=0.1 _ 1-21 = 1=5.1 
ff a i 
= zea il lay arel Sae ar aaae rae be 8 a Ma as ale a a Ta = am aie ier i oe a ba 2 = 
b. Streamline diffusion method with shock-capturing, 8=8=h=0.1. 
: 1=0.1 2 132.4 4 t=5.1 

; 3 
tH aa eal a a Bhs aa ems ma pies 


c. Streamline diffusion method with shock-capturing, ’—h=0.1, 3=0.25. 
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wea 


T 
ix 


or a = 


d, Lax-Friedsichs’ method. 


t=01 


ste 7 —— 


f. Streamtine diffusion method without shock-capturing and with incorrect choice of &(b=0 and 
&=100 h). 
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